human VRl gene with translation 6c open reading frame 



Input file Fchrb87a6 .eeq; Output File Pchrb87a6 * tra 
Sequence length 3909 

GTGAGCGCAACX3CACTGCGTO 

CGCACTGCX3GGCAGTGAGCGCAACX5CACTG 

OOAGTGAGCGCAACGCACTTGCGGGC^^ 

GCAAOSCACTGCGGGCAGTGAGCGC^ 

GCGGGCAGTGAGC^CAACGCACTGCGGGCAGTGAGCGCAACGCAt^ 
AGCGCAACGCACTCSCGGGCAGTGAGCGCAACGCACTTAATGTGAGTTAGCTC^ 

TTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGA 

ACGCCAAGCrTCTAATACGACTCACTATAGGGAAAGCTGGTACGCCTGCAGGTACC^ 

ACGCGTCCGAAAACACACCTCTCTGCTGTGGGAA 

IissJ 

GTf^^CTGGGGAGAATTCXSTAGATCATCCT 
<XdS&AAAAGCTGTCCACAGTAGTCCCCCCrrTAT^ 

AGkfeTCTCTGCCGTGC^ 

AGfEtACTTAGTCGTCAGATCGCCC^ 

O MKKWSSTDIiGTAADPLQK 18 

GC&AGG ATG AAG AAA TGG AGC AGO ACA GAC TTG GGG ACA GCT GCG GAC CCA CTC CAA AAG 54 

T C P D P L D 
3AC ACC TGC CCA GAC CCC CTG GAT 

O L P T A K S R 
CAG CTC CCC ACG GCC AAG AGC CGC 

FPVDCPH E 
rPC CCG GTG GAT TGC CCC CAC GAG 

S PVITIQR 
XOC CCT GTT ATC ACC ATC CAG AGG 

Q D 6 V A A S T 
GAC TCT GTC GCC GCC AGC ACC 

FEAVAQNN 
TT GAA GCC GTT GCT CAG AAT AAC 



GDPNS RPPPAKP 38 

GGA GAC CCT AAC TCC AGG CCA CCT CCA GCC AAG CCC 114 

TRLFGKGDSEEA 58 

ACC GGG CTC TTT GGG AAG GGT GAC TOG GAG GAG GCT 174 

EGELDSCPTITV 78 

GAA GGT GAG TTG GAC TCC TGC CCG ACC ATC ACA GTC 234 

PGDGP TGARLIiS 98 

CCA GGA GAC GGC CCC ACC GGT GCC AGG CTG CTG TCC 294 

EK TLRLYDRR SI 118 

GAG AAG ACC CTC AGG CTC TAT GAT CGC AGG AGT ATC 354 

CQDLES LLLFLQ 136 

TGC CAG GAT CTG GAG AGC CTG CTG CTC TTC CTG CAG 414 



FIGURE 1 



K S K K H L T 
AAG AGC AAG AAG CAC CTC ACA 



L L K A M L N 
CTG CTG AAA GCC ATG CTC AAC 

E I A R Q T D 
SAG ATC GCG CGG CAA ACG GAC 

Y Y K G Q T A 
TAC TAC AAG GGC CAG ACA GCA 

T L L V E N G 
ACC CTC CTG GTG GAG AAC GGA 

K T K G R P G 
AAA ACC AAA GGG CGG CCT GGA 

T N Q L G I V 
ACC^AAC CAG CTG GGC ATC GTG 

SU3 A R D S V G 
AGGlGCC AGG GAC TCG GTG GGC 

T* : =J A D N T K F 
ACGfcGCC GAC AAC ACG AAG TTT 

KM L H P T L K 
AAA CTG CAC CCG ACG CTG AAG 

aK L A A G T G 
GCp| CTG GCA GCT GGG ACC GGG 

E P E C R H 
C^T GAG CCC GAG TGC AGG CAC, 

E SSL Y D L 
CAC TCC TCG CTG TAC GAC CTG 

V I A Y S S S 
GTG ATC GCC TAC AGC AGC AGC 

L N R L L Q D 
CTG AAC GGA CTC CTG CAG GAC 

F L V Y C L Y 
TTC CTG GTC- TAC TGC CTG TAC 

D G Ii P P F K 
XAT GGC TTG CCT CCC TTT AAG 

I L S V L G G 
\TC CTG TCT GTG TTA GGA GGA 



D N E F K D P 
GAC AAC GAG TTC AAA GAC CCT 



L H D G Q N T 
CTG CAC GAC GGA CAG AAC ACC 

S L K E L V N 
AGC CTG AAG GAG CTT GTC AAC 

L H I A I E R 
CTG CAC ATC GCC ATC GAG AGA 

A D V Q A A A 
GCA GAC GTC CAG GCT GCG GCC 

F Y F G E L P 
TTC TAC TTC GGT GAA CTG CCC 

K F L L Q N S 
AAG TTC CTG CTG CAG AAC TCC 

N T V L H A L 
AAC ACG GTG CTG CAC GCC CTG 

V T S M Y N E 
GTG ACG AGC ATG TAC AAT GAG 

L E E L T N K 
CTG GAG GAG CTC ACC AAC AAG 

K I G V L A Y 
AAG ATC GGG GTC TTG GCC TAT 

L S R K F T E 
CTG TCC AGG AAG TTC ACC GAG 

B C I D T C E 
TCC TGC ATC GAC ACC TGC GAG 

E T P N R H D 
GAG ACC OCT AAT OGC CAC GAC 

K W D R F V K 
AAG TGG GAC AGA TTC GTC AAG 

MI I F T M A 
ATG ATC ATC TTC ACC ATG GCT 

M E K I G D Y 
ATG GAA AAA ATT GGA GAC TAT 

V Y F F F R G 
GTC TAC TTC TTT TTC CGA GGG 



E T G K'**~T C 158 
GAG ACA GGG AAG ACC TGT 474 



T I P L L L 178 

ACC ATC CCC CTG CTC CTG 534 

A S Y T D S 198 

GCC AGC TAC ACG GAC AGC 594 

R N M A L V 218 

CGC AAC ATG GCC CTG GTG 654 

H G D F F K 238 

CAT GGG GAC TTC TTT AAG 714 

L S L A A C 258 

CTG TCC CTG GCC GCG TGC 774 

W Q T A D I 278 

TGG CAG ^ACG GCC GAC ATC 834 

V E V A D N 298 

GTG GAG GTG GCC GAC AAC 894 

I L M L G A 318 

ATT CTG ATG CTG GGG GCC .954 

K G M T P L 338 

AAG GGA ATG ACG CCG CTG 1014 

I L Q R E I 358 

ATT CTC CAG CGG GAG ATC 1074 

W A Y G P V 378 

TGG GCC TAC GGG CCC GTG 1134 

K N S V . L E 398 

AAG AAC TCG GTG CTG GAG 1194 

M L L V E P 418 

ATG CTC TTG GTG GAG CCG 1254 

R I F Y F N 438 

OGC ATC TTC TAC TTC AAC 1314 

A Y Y R P V 458 

GCC TAC TAC AGG CCC GTG 1374 

F R V T G E 478 

TTC CGA GTT ACT GGA GAG 1434 

I Q Y F L Q 498 

ATT CAG TAT TTC CTG CAG 1494 



FIGURE 1 (cont'd) 



RRPSMKTLFVD 
-GG CGG CCG TCG ATG AAG ACC CTG TTT GTG GAC 

QSLFMLATVVL 
CAQ TCA CTG TTC ATG CTG GCC ACC GTG GTG CTG 

ASMVFSLALGW 
XTT TCC ATG GTA TTC TCC CTG GCC TTG GGC TGG 

FQQMGIYAVMI 
CTC CAG CAG ATG GGC ATC TAT GCC GTC ATG ATA 

RFMFVYIVFLF 
XTT TTC ATG TTT GTC TAC ATC GTC TTC TTG TTC 

IEDGKNDS LPS 
ITT GAA GAC GGG AAG AAT GAC TCC CTG CCG TCT 

PACRPPDSSYN 
:CT GCC TGC AGG CCC CCC GAT AGC TCC TAC AAC 

F ^KFTIGMGDLE 
rrd gVAG TTC ACC ATC GGC ATG GGC GAC CTG GAG 

V |;H p I ILLLAYVI 

rrcMrrc atc atc ctg ctg ctg gcc tat gta att 

I, M= I A L M G E tvnk 
-TO -ATC GCC CTC ATG GGT GAG ACT GTC AAC AAG 

K^LQRAITI LDT 
AQ JCTG CAG AGA GCC ATC ACC ATC CTG GAC ACG 

K FRSGKLLQV 
JUj HGfCC TTC CGC TCA GGC AAG CTG CTG CAG GTG 

yJ;Jr W C F R V D E V N 
AC^CGG TGG TGC TTC AGG GTG GAC GAG GTG AAC 

I INEDPGNC EG 
TC ATC AAC GAA GAC CCG GGC AAC TGT GAG GGC 

RSS RVSGRHWK 
GG TCA AGC AGA GTT TCA GGC AGA CAC TGG AAG 

EASARDRQSAQ 
AG OCA AGT GCT CGA GAT AGG CAG TCT GCT CAG 

SGSLKPEDAEV 
CA GGG TCT CTG AAG CCA GAG GAC GCT GAG GTC 

K * 
AG TGA 



S Y S E M L F,„.-F L 518 

AGC TAC AGT GAG ATG CTT TTC TTT CTG 1554 

YFSHLKEYV 538 

TAC TTC AGC CAC CTC AAG GAG TAT GTG 1614 

T NMLYYTRG 558 

ACC AAC ATG CTC TAC TAC ACC CGC GGT 1674 

EKMILRDLC 578 

GAG AAG ATG ATC CTG AGA GAC CTG TGC 1734 

GFSTAVVTL 598 

GGG TTT TCC ACA GCG GTG GTG ACG CTG 1794 

ESTSHRWRG 618 

GAG TCC ACG TCG CAC AGG TGG CGG GGG 18 54 

SLYSTCLEL 638 

AGC CTG TAC TCC ACC TGC CTG GAG CTG 1914 

FTENY-DFKA 658 

TTC ACT GAG AAC TAT GAC TTC AAG GCT 1974 

LTY ILLLNM 678 

CTC ACC TAC ATC CTC CTG CTC AAC ATG 2034 

IAQESKNIW 698 

ATC GCA CAG GAG AGC AAG AAC ATC TGG 2094 

EKS FLKCMR 718 

GAG AAG AGC TTC CTT AAG TGC ATG AGG 2154 

GYTPDGKDD 738 

GGG TAC ACA CCT GAT GGC AAG GAC GAC 2214 

WTTWNTNVG 758 

TGG ACC ACC TGG AAC ACC AAC GTG GGC 2274 

VKRTLSFSIi 778 

GTC AAG CGC ACC CTG AGC TTC TCC CTG 2334 

NFALVPLLR 798 

AAC TTT GCC CTG GTC CCC CTT TTA AGA 2394 

PE'EVYXjRQF 818 

CCC GAG GAA GTT TAT CTG CGA CAG TTT 2454 

FKSPAASGE 838 

TTC AAG AGT CCT GCC GCT TCC GGG GAG 2514 

840 
2520 



FIGURE 1 (cont'd) 



GGACGTCACGCAOACAGCACTGTC^ 
TGCTCroTCAGCAGCCTOGCCri^ 

TCTTGGAAGCATGGGGAGTGATGTACATCCAACCGTCACTGTCC^ 
CTCACTTTACTAAAAAAAAAAAAAAAAGGGCGGCCGCTTA 



FIGURE 1 (cont'd) 



Pull -length human VR2 



Input file Flh21ell.seq; Output File Flh21ell.tra 
Sequence length 2809 

QQCTAOOCIKSTOCTGACAflQQQAfl^ 

iUJAOACCAOAACXTOCXTOCIGaJtfK^ 

GGOU3<XXXnXXXXXXOTCACT^^ 

MTS PSSSP 8 

TGCACAGAGGTCCTGGCTOGACCXSAGCAGCCTCC^ ATG ACC TCA CCC TCC AGC TCT CCA 24 

VFRLETLDGGQEDGSEADRG 26 

GTT TTC AGG TTG GAG ACA TTA GAT GGA GGC CAA GAA GAT GGC TCT GAG GOG GAC AGA GGA 84 

KLDFGSGL P PMES QFQGEDR 48 

AAG CTG GAT TTT GGG AGC GGG CTG CCT CCC ATG GAG TCA CAG TTC CAG GGC GAG GAC CGG 144 

KFAPQIRVNLNYRKGTGASQ 68 

AAA TTC GCC CCT CAG ATA AGA GTC AAC CTC AAC TAC OGA AAG GGA ACA GGT GCC AGT CAG 204 

yP DPNRFDRDRLFNAVSR GVP 88 

COG GAT CCA AAC CGA TTT GAC OGA GAT OGG CTC TTC AAT GOG GTC TCC OGG GGT GTC CCC 264 



:;.^E DLAGLPEYLSKTSKYLTDS 108 

J SAG GAT CTG GCT GGA CTT CCA GAG TAC CTG AGC AAG ACC AGC AAG TAC CTC ACC GAC TOG 324 

l ,sL H YTEGSTG fCTCLMKAVLNLK 128 

§ GAA TAC ACA GAG GGC TCC ACA GGT AAG AOG TGC CTG ATG AAG GCT GTG CTG AAC CTT AAG 384 

[ D G V N A C I Ij P L L Q I D R D S G N P 148 

GAC GGA GTC AAT GCC TGC ATT CTG CCA CTG CTG CAG ATC GAC AGG GAC TCT GGC AAT CCT 444 

I-MQ PZfVNAQCTDDYYRG'HSALH 168 

rJSAG CCC CTG GTA AAT GCC CAG TCC ACA GAT GAC TAT TAC OGA GGC GAC AGC GCT CTG GAC S04 

3 -:^ZAZBKR6Z.QCVKXtIiVE'N6AK 188 

•;3UrC GCC ATT GAG AAG AGG AGT CTG CAG TOT GTG AAG CTC CTG GTG GAG AAT GGG GCC AAT 564 

V H A R A- C_ O . R F F Q KG Q G T C F Y F 208 

GIG CAT GCC OGG GOC TGC GGC OGC TTC TTC GAG AA0 GGC CAA OGG ACT TGC TTT TAT TTC 624 

O B 1. P It S Xm A A C T ' K Q W D V V S Y L 228 

OGT GAG CIA OOC CTC TCT TTG GOC GCT TGC ACC AA0 GAG TOG GAT GTG GTA AGC TAC CTC 684 

Ii BN P HQ P A S Jj QA.T D S Q G NT V 248 

CTG GAG AAC CCA GAC GAG OOC GOC. AGC CTG GAG GOC ACT GAC TCC CAG GGC AAC AGA GTC 744 

ItHA LVMZSDNS AENZAItVTS 268 

CTG GAT GOC CTA GTG ATG ATC %OG GAC AAC TCA GCT GAG AAC ATT GGA CTG GTG ACC AGC 604 

H YDGLLQAGARIiC PTV QLBD. 288 

ASG TAT GAT GGG CTC CTC CAA GCT GGG GOC CSC CTC TGC CCT ACC GTG GAG CTT GAG GAd 664 

Z RRLQD LT P Xi IC Ij A A K E G K Z E 308 



FXGUKE 2 



k OG^J^ CTO AAG CIO GCC GCC AAG GAO OC^^lo 



ATC OGC AAC CTO CAD OAT CTC AOO^^ CTO AAG CTO GCC GCC AAG GAG GC^^lG ATC GAG 924 

I F R H I L Q R B F S G it S H L S R K *" F 326 
ATT TTC AGO CAC ATC CTO GAG OGG GAG TTT TCA GGA CTO AGO CAC CTT TCC CQA AAG TTC 964 

T E HCY G P V R V £ L Y DLAS V D S 348 

ACC GAG TOG TOC TAT GGG CCT GTC OGG GTO TOG CTO TAT GAC CTO GOT TCT GTO GAC AGO 1044 

CEE NSVLE I IAFHCKSPHRH 366 

TOT GAG GAG AAC TCA GTO CTO GAG ATC ATT GCC TTT CAT TGC AAG AGC COG CAC OQA CAC 1104 

RMVVLEP LKKLLQAKWDLLI 388 

OGA ATO GTC GTT TTG GAG CCC CTG AAC AAA CTO CTO CAG GOG AAA TOG GAT CTO CTC ATC 1164 

P KFFLN F L C N L I YM F I F T A V 408 

CCC AAG TTC TTC TTA AAC TTC CTO TOT AAT CTG ATC TAC ATO TTC ATC TTC ACC GCT GTT 1224 

AYHQPTL K KQAA PHL KAEVG 428 

GCC TAC CAT CAG CCT ACC CTG AAG AAG CAG <5CC GCC CCT CAC CTG AAA GCG GAG GTT GGA 1284 

NSMLLTG H I L I L LGG I Y LL V 448 

AAC TCC ATG CTG CTG ACG GGC CAC ATC CTT ATC CTG CTA GGG GGG ATC TAC CTC CTC GTG 1344 

GQLWYFWRRHVFIWISFIDS 468 

GGC CAG CTO TGG TAC TTC TGG OGG OGC CAC GTG TTC ATC TOG ATC TOG TTC^ATA GAC AGC 1404 

l i Y FE I It F L F QAIi L T VV. S QVLC 488 

JHtAC TTT GAA ATC CTC TTC CTO TTC CAG GCC CTG CTC ACA GTO GTG TOC CAG GTG CTG TGT 1464 

ffiFLAIEWY LPLLVSALVLGWL 508 

.TTC CTO GCC ATC GAG TGG TAC CTG CCC CTG CTT GTO TCT GOG CTG GTO CTG GGC TGG CTG 1524 

JTn IjLYYTRG FQHTG IYSVMIQ 528 

fAAC CIO CTT TAC TAT ACA OGT GGC TTC CAG CAC ACA GGC ATC TAC AGT GTC ATG ATC CAG 1584 

"KVXLRDLXiRFIiIiXYLV.FXiFG 548 

i:3WWS GTC ATC CTG OGG GAC CTO CTO OGC TTC CTT CTG ATC TAC TTA GTC TTC CTT TTC GGC 1644 

IL'-F A V A L V S IiSQEAWRP E APTG 568 

J HCTC GOT GTA GCC CTO GTO AGC CTO AGC CAG GAG GCT TGG OGC CCC GAA GCT OCT ACA GGC 1704 

OP MATESVQ FMEGQEDEGNGA 588 

1 3DOC AAT GCC ACA GAG TCA GTO GAG CCC ATG GAG GGA GAG GAG GAC GAG GGC AAC GGG GCC 1764 

Q Y ROIL B A S I/E I. F K F T I G . M G 608 

GAG TAC AGG OGT ATC GIG GAA GCC TOC TXO GAG CTC TTC AAA TTC AOC ATC GGC ATG GGC 1824 

B Ii A F Q B Q It H F R G M V L I* X* V L A 628 

GAG CTG GCC TTC GAG GAG GAG CTG GAC TTC OGC GGC ATG GTG CTG CTG CTG GTG CTO GCC 1684 

TVLL TY X It L L N M L XAIiHS ET 648 

TAC GTO CTG CTC AOC TAC ATC CTG CTG GTC AAC ATG CTC ATC GCC CTC ATG AGC GAG ACC 1944 

VHSVATDS H e IHICLQK A ISV 668 

GTC AAC AGT GTC GCC ACT GAC AGC TGG AGC ATC TOG AAG CtG CAG AAA GCC ATC TCT GTC 2004 

. It B M E H G YW W G R K KQ R AG VMt. 688 

CTO GAG ATS GAG AAT OGC TAT TGG TOG TOC AGG AAG AAG GAG GGG GGA GOT GTO ATG CTG 2064 

TVGTK PDGS PDERWCFRVEE 708 



FIGURE 2 Coontinued) 



GG<i^^: COO GAT GAG OOC TOO TOC TTC Ali^^O 



AOC OTT OOC ACT AAO CCA GAT GOd^^b COO GAT GAG OOC TOO TOC TTC Ad^Pttt GAG GAG 2124 

VNWASWEQTLPTLCEDPSg' **A 728 

GTO AAC TGO GCT TCA TGO GAG CAG AOG CTG CCT AGO CTG TGT GAG GAC COO TCA GGG OCA 2184 

GVPRTL EN P V L A S P P KEDED 748 

GOT GTC OCT OOA ACT CTC GAG AAC OCT GTC CTG GCT TCC CCT CCC AAG GAG GAT GAG GAT 2244 

GASEENYVPVQLLQSN* 765 

GOT GCC TCT GAG GAA AAC TAT GTO CCC GTC CAG CTC CTC CAG TCC AAC TGA 2295 

TGGCCCAGATGCAGCAGGAGGCCAGAGGACAGAGCAGAGGATCT 

AATTCTGGTOGCAAATATATATTTTCAQTAACTCA 



FIGURE 2 Coontlnuedl 



Partial human VR2 alte form 

Input file frhobl2c4.seq; Output File frh bl2c4.tra 
Sequence length 1469 

ORFFQ KOQGTCFYFGEI/PL 
OC OOC OGC TTC TTC CAO AAG OOC CAA OOG ACT TGC TTT TAT TTC GOT GAG CTA OCC CTC 

SLAACTKQWDVVSYLLENPH 
TCT TTG OCC GCT TGC ACC AAG CAG TGG GAT GTG GTA AGC TAG CTC CTG GAG AAC CCA CAC 

QPAS LQATDSQGNTVLHALV 
CAG CCC GCC AGC CTG CAG GCC ACT GAC TCC CAG GGC AAC ACA GTC CTG CAT GCC CTA GTG 

M 1 SDNS A E N IALVTS MYDGL 
ATG ATC TOG GAC AAC TCA GCT GAG AAC ATT GCA CTG GTG ACC AGC ATG TAT GAT GGG CTC 

LQAGARLC PTVQIiEDIRNLQ 
CTC CAA GCT GGG GCC CGC CTC TGC CCT ACC GTG CAG CTT GAG GAC ATC CGC AAC CTG CAG 

DLT PLKLAAK.EGKIEIFRHI 
GAT CTC ACG CCT CTG AAG CTG GCC GCC AAG GAG GGC AAG ATC GAG ATT TTC AGG CAC ATC 

I*OREFSGIjSHIjSRKFTEWCY 
CTG CAG GGG GAG TTT TCA GGA CTG AGC CAC CTT TCC OGA AAG TTC ACC GAG TGG TGC TAT 

■g G P V R V S L Y D L A SVDSCEENS 
J - GGG CCT GTC OGG GTG TOG CTG TAT GAC CTG GCT TCT GTG GAC AGC TGT GAG GAG AAC TCA 

IPVLEI IAFHCKS PHRHRMVVL 
VJGTG CTG GAG ATC ATT GCC TTT CAT TGC AAG AGC COG CAC OGA CAC OGA ATG GTC GTT TTG 

j^EPI.NKIiLQ AKKDLI.1 PKFFL 
; OCC CTG AAC AAA CTG CTG CAG GOG AAA TGG GAT CTG CTC ATC CCC AAG TTC TTC TTA 

L N F L c N I* I Y M F I F T A V A Y H Q P 
' ^ PkC 1X0 010 TGT AAT 0X0 ATC TAC ATO 7X0 ATC TTC ACC GCT GTT GCC TAC CAT CAG CCT 

q T L.K K Q A A P H L K A E V G N S MI. L 
jiyAOC CTG AAG AAG GAG GCC GCC OCT CAC CTG AAA GOG GAG GTT GGA AAC TCC ATG CTG CTG 

hJ. TOHZXi.II.IiOGZYIiIiVG-QI.H-Y 
ijAOG OGC GAC ATC CTT ATC CTG CTA OGG OOG ATC TAC CTC CTC GTG GGC GAG CTG TGG TAC 

~F- H R R H V F Z .H Z S F Z D 8 Y F E Z i 

TTO TOG OGG OGC CAC GTG TTC ATC TOO ATC TOG TTC ATA GAC AGC TAC TTT OAA ATC CTC 

P LFQALItTVVSQV IiCFZi AZ E 
TTC CIO TTC GAG OOC CTG CTC ACA GTG GTG TCC GAG GTG CTG TOT TTC CTG OOC ATC GAG 

HYLPLL VSAIiVIiGHIiNLL YY 
TOO TAC CTG OOC CTG CTT GTG TCT GOO CTG GTG CTG GGC TOG CTG AAC CTG CTT TAC TAT 

TRGFQH TG I Y6VMZQKK AIS 
ACA GOT OGC TTC CAG GAC ACA OOC ATC TAC AGT GTC ATG ATC GAG AAG AAA GCC ATC TCT 

V. I. B M E N G Y H H C R K K Q RA GV H 
GTC CTG GAG ATG GAG AAT GGC TAT TOG TOO TGC AGO AAG AAG GAG OGG GGA GOT GTG ATG 



19 
S7 

39 
117 

59 
177 

79 
237 

99 
297 

119 
357 

139 
417 

159 
477 

179 
537 

199 
597 

219 
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239 
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259 
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837 

299 
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319 • 
957 



339 
1017 

359 
1077 



FIGURES 3 * 



i*tvgtkpd1 

ctg acc ott ooc act aag cca oat 




S P D E R W C V E 

AGC COG OAT GAG OOC TOG TGC TTC AGG GTG GAG 




379 
1137 



EVNHAS W E Q T L P T LC ED PSG 399 

GAG GTG AAC TOO OCT TCA TGG GAG CAG AOG CTG CCT ACQ CTG TOT GAG GAC COG TCA GGG X197 

AGVPRTLENPVLASPPKEDE 419 

OCA GOT GTC CCT OOA ACT CTC GAG AAC CCT GTC CTG OCT TCC CCT CCC AAO GAG GAT GAG 1257 

DGASEENYVPVQLLQSN* 437 

GAT GGT GCC TCT GAG GAA AAC TAT GTG CCC GTC CAG CTC CTC CAG TCC AAC TGA 1311 

TGG<XXIAGATGCAGCAGGAGGCCAGLAGGACAG 

AATTCTGGTGGCAAATATATATTTTCACTAACT^^ 

GCGGAOGCGTGGGTOGAC 



F3J3OTE 3 (gxxatSnoeOl 



Partial rat VR2 



Input file Flrxbl47gli.eeq; Output File Flrxbl47gll . tra 
Sequence length 1794 

S T H A S A L S IiAACTKQWDVV 19 

G TOO ACC CAC OOO TOC OCT CTT TCT CTG OCT GOG TOO ACC AAG GAG TOG GAT GTG GTG 57 

TYLLEKPHQPA S LEATDSLG 39 

ACC TAC CTC CTG GAG AAC CCA CAC CAG COG GCC AGC CTG GAG GCC ACC GAC TCC CTG GGC 117 

NTVtfHALVM IADNS P EK SAL 59 

AAC ACA GTC CTG CAT OCT CTG GTA ATG ATT GCA GAT AAC TOG OCT GAG AAC AGT GCC CTG 177 

VIHMYDG LLQMGARLCPTVQ 79 

GTG ATC CAC ATG TAC GAC GGG CTT CTA CAA ATG GGG GOG OGC CTC TGC CCC ACT GTG CAG 237 

LEEISNHQGLTPLKLAAKEG 99 

CTT GAG GAA ATC TCC AAC CAC CAA GGC CTC ACA CCC CTG AAA CTA GCC GCC AAG GAA GGC 297 

KIEIFRH I LQREFSGPYQPL 119 

AAA ATC GAG ATT TTC AGG CAC ATT CTG CAG CGG GAA TTC TCA GGA CCG TAC CAG CCC CTT 357 

SRKFTEWCYGPVRVSLYDLS 139 

TOC OGA AAG TTT ACT GAG TGG TGT TAC GGT OCT GTG OGG GTA TOG CTG TAC^GAC CTG TCC 417 

'ZS VDSWEKNSVLEI IAFHCKS 159 

]*tCT GTG GAC AGC TGG GAA AAG AAC TOG GTG CTG GAG ATC ATC GOT TTT CAT TGC AAG AGC 477 

MP NRHRMV VLEPLNKLLQE KW 179 

gCG AAC OGG CAC OGC ATG GTG GTT TTA GAA CCA CTG AAC AAG CTT CTG CAG GAG AAA TGG 537 

I" D R fcVSRFFFNFACYLVYMFI 199 

j qAT OGG CTC GTC TCA AGA TTC TTC TTC AAC TTC GCC TGC TAC TTG GTC TAC ATG TTC ATC 597 

5! P. T . V V A Y H Q P S L D 0 P A I P S S K 219 

?!!fTC ACC GTC GTT GCC TAC CAC CAG CCT TCC CTG GAT CAG CCA GCC ATC CCC TCA TCA AAA 657 

]L"A TFGESMIfXiLGHXXi.IIfXiGGX 239 

•<GOG ACT TTT GGG GAA TOC ATG CTG CTG CTG GGC CAC ATT CTG* ATC CTG CTT GGG GGT ATT 717 

Q.Y liliXiGQIiKTFHRRRXiFXHXS 259 

i;SAC CTC TTA CTG GGC GAG CTG TGG TAC TTT TGG CGG OGG OGC CTG TTT ATC TGG ATC TCA 777 

F M T> S ' X F * E I If F If 1. ' Q A L I. T V L S 279 

TTC AXG GAC AGC TAC TTT GAA ATC CTC TTT CTC CTT GAG GOT CTG CTC ACA GTG CTG TOC 837 

QVLRFHBTBKYLPI* !■ VI.SLV 299 

GAG GTG CTG OGC TTC ATG GAG ACT GAA TGG TAC CTA OCC CTO CTA GTG TTA. TOC CTA GTG 897 

LGtfX.HX.XiYYTRGFQKTGXYS 319 

CTG GGC TOG CIO AAC CTG CTT TAC TAC ACA OGG GGC TTT GAG. CAC ACA GGC ATC TAC AGT 9S7 

VMXQKVZ'X.RDI.XiRFXeX.V.TXiV 339 

OXC AXG ATC GAG AAG GTC ATC CTT OSA GAC CTG CTC OGT TTC CTG CTG GTC TAC CTG GTC 1017 

F I.F G FAVA Xi VG L 6 R E A R S P K 359 

TTC CTT TTC GGC TTT OCT GTA GCC CTA GTA AGC TTG AGC AGA GAG GCC OGA AGT CCC AAA 1077 



A P E D N N S T^^P T E O P T V G ^^P E E E 379 

OCX: OCT GAA GAT AAC AAC TCC ACf^TG ACQ GAA CAO CCC ACO GTO GOC CA^OAO GAG. GAG 1137 

P A P Y R S ILDASliELFKFTlG 399 

OCA OCT OCA TAT OGG AGO ATT CTG GAT GCC TCC CTA GAG CTG TTC AAG TTC AOC ATT GGT 1197 

HGELAFQEQItRFRGVVXfLLL 419 

ATG OGG GAG CTG GCT TTC GAG GAA GAG CTG OGT TTT CGT GGG GTG GTC CTG CTG TTG CTO 1257 

LAYVLIiTYVLLLNML IALMS 439 

TTG GCC TAC GTC CTT CTC ACC TAC GTC CTG CTG CTC AAC ATG CTC ATT GCT CTC ATG AGC 1317 

ETVNHVADNSWS IWKLQKAl 459 

GAA ACT GTC AAC CAC GTT GCT .GAC AAC AGC TGG AGC ATC TGG AAG TTG GAG AAA GCC ATC 1377 

SVLE MENGYWWCRRKKHREG 479 

TCT GTC TTG GAG ATG GAG AAT GGT TAC TGG TGG TGC OGG AGG AAG AAA CAT OGT GAA GGG 1437 

RLLKVGTRGDGTPDERWCFR 499 

AGG CTG CTG AAA GTC GGC ACC AGG GGG GAT GGT ACC CCT GAT GAG CGC TGG TGC TTC AGG 14 97 

VEEVNWAAWEKTLPTLSEDP 519 

GTG GAG GAA GTA AAT TGG GCT GCT TGG GAG AAG ACT CTT CCC ACC TTA TCT GAG GAT CCA 1557 

S O PG I TGN K KN P T S K P G^ K N S 539 

TCA GGG OCA GGC ATC ACT GGT AAT AAA AAG AAC CCA ACC TCT AAA COG GGG AAG AAC AGT 1617 

A.S E E D H L P L Q V I* Q S P * 555 

GOC TCA QAG GAA GAC CAT CTG CCC CTT GAG GTC CTC CAG TCC CCC TGA 1665 

TGGCCCAGATGTCAGGAGCAGGCTGGGAGGATXTC 
GAAMATAAM ^^ 



FXGQPE 4 (cxxxtjjiifed) 



GAP of: humanvr2.pep check: 574 6 from: 1 to: 7 64 
humanVR2 Flh21ell 



to: humanvrl .pep check: 6877 from: 1 to: 839 

humanVRl _Fbhl8547pat - fchrb87a6, 3909 bases, 4554 checksum. 

Symbol comparison table: 
/ddm_local/gcg/gcg_9 - l/gcgcore/data/rundata/blosum62 . cmp 
CompCheck: 64 30 



Gap Weight: 12 

Length Weight : 4 

Quality: 1530 

Ratio: 2.003 

Percent Similarity: 55.378 



Average Match: 2.912 
Average Mismatch: -2.003 

Length: 850 
Gaps: 10 
Percent Identity: 46.348 



Match display thresholds for the alignment (s ) : 
| = IDENTITY 
: = 2 
. = 1 



humanvr 2 . pep x humanvrl . pep 

1 . MTSPSSSPVF 10 

I . I - . I 

1 MKKWSSTDLGTAADPLQKDTCPDPLDGDPNSRPPPAKPQLPTAKSRTRLF 50 

11 RLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 60 

: : I . I I : . : I I . 

51 GKGDSEEAFPVDCPHEEGELDSCPTI . TVSPVTTIQRPGDGPTGARLLSQ 99 

61 RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEY 110 
. ' . : : I I : I I I . . : I I I : I I . I : I I I . I : 

100 DSVAASTEKTLRLYDRRSIFEAVAQNNCQDLESLLLrLQKSKKHLTDNEF 14 9 

• • • • * 

111 TEGSTGKTCIMCAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDY 160 

: I I I I I I: I I - I I I I I I I I I : I ! ... I I I I III 
150 KDPETGKTCLLKAMLNLH DGQNTT I PLLLE I ARQT DS LKELVNAS YT DS Y 199 

161 YRGHSAUilAIEKRSLQCVTCLLVENGANVHARACGRFFQKGQG . TCFYFG 209 

1:1.1111111:1.: I I I I I I I I , I I I I I I . I . I Mil 
200 YKGQTALHIAIERRNMALVTLLVENGADVQAAAHGDFFKKTKGRPGFYFG 249 

210 ELPLSLAACTKQWDVVS YLLENPHQPASLQATDSQGHTVLHALVMI S DNS 259 

I I I I I I I I I I I : I : I I : I I I : I II I I I I I I I I I : . I I . 
250 ELPLSIAACTNQLGIVKFLLQNSWQTADISARDSVGNTVXHALVEVADNT 299 

260 AENIALVTSMYTCIJjQAGARIjCPTVQLEDIRNLQ 309 

1:1 Mill. :l 11:1 ll-.M:: I . MM III III : 
300 AONTKFVTSMYNEILMLGAKLHPTLKLEELTNKKGMTPLALAAGTGKIGV 349 



FIGURE 5 



* • • • • 

310 FRHILQREFS. .GLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEI 357 

: I I I I I I I I I I I I I I I I I I Mill. : I . I I . I I I II : 

350 LA YI LQRE I QE PECRHLS RKFTEWA YG P VHSS L YDLSC I DTCEKNS VLEV 399 

358 IAF.HCKSPHRHRMWLEPLNKLLQAKWDLLIPK, FFLNFLCNLIYMFIF 405 

II: • • I - M I ... I I I I : I I I I I I : : I : I I I : I I I I 
4 00 IAYSSSETPNRHDMLLVEPLNRLLQDKWDRFVKRI FYFNFLVYCLYMI I F 4 49 

4 06 TAVAYHQPTLKKQAAPHLKAEVGNSMLLTGH I L I LLGGI YLLVGQLWYFW 4 55 

I I I : • I .J . : I . . I I I I . I I I : I : II 

450 TMAAYYRPV. . DGLPPFKMEKIGDYFRVTGEILSVLGGVYFFFRGIQYFL 497 

456 RRHVFI WI S FI DS YFE I LFLFQALLT VVSQVLCFLAIEWYLPLLVSALVL 505 

.1 . I : I I I I . I I I.I ..Ml : . I . : I . I I 
4 98 QRRPSMKTLFVDSYSEMLFFLQSLFMLATWLYFSHLKEYVASMVFSLAL 54 7 

506 GWLNLLYYTRGFQHTGI YSVMIQKVILRDLLRFLLI YLVFLFGFAVALVS 555 

II I : I I I I I I I I I I I . I I I : I . II I I I II: : I : I I I I I I - I . I - 

54 8 GWTNMLYYTRGFQQMGI YAVMIEKMILRDLCRFMFVYIVFLFGFSTAVVT 597 

556 LSQEAWRPEAPTGPNATESVQPMEGQEDEGNGAQYRGILEASLELFKFTIj: 605 

I : : I . • I I . I : I I I I I I I I 

598 LIEDGKNDSLPSESTSHRWRGPACRPPD SSYNS^YSTCLELFKFTI 64 3 

606 GMGELAFQEQLHFRGMVLLLLLAYVLLTYILLLNMLIALMSETVNSVATD 655 

111:1 I I I : . : : I I I I I I : I I I I II I I II I I I I I I I I : I : 
64 4 GMGDLEFTENYDFKAVFIILLLAYVILTYILLLNMLIALMGETVNKIAQE 693 

656 SWSIWKLQKAISVLEMENGYWWC . RKKQRAGVMLTVGTKPDGSPDERWCF 704 

I . I I I I I : I I . : I : I : III I . I : I I I III I I I I I 
694 SKNI WKLQRAI T I LDTEKS FLKCMRKAFRSGKLLQVG YT PDGKDDYRWC F 74 3 

705 RVEEVNWASWEQTLPTLCEDPSGA. GVPRTLENPVLASPPKEDEDGASEE 753 

11:1111.1 . : III I I I I I • • I I 

744 RVDEVNWTTWNTNVGIINEDPGNCEGVKRTLSFSLRSS. . • .RVSGRHWK 789 

754 NYVPVQLLQSN • 764 

I: I I I. 

790 NFALVPLLREASARDRQSAQPEEVYLRQFSGSLKPEDAEVFKSPAASGEK 839 



FIGURE 5 (cont'd) 



GAP of: humanvr2.seq check: 8853 from: 1 to: 2809 
humanVR2 21ella, 2809 bases, 8853 checksum. 

to: humanvrl. seq check: 4554 from: 1 to: 3909 

humanVRl Fbhl854 7pat - Import - complete 

Symbol comparison table: 
/ddm_local/gcg/gcg_9. 1/gcgcore/data/rundata/nwsgapdna . cmp 
CompCheck: 87 60 

Gap Weight: 50 Average Match: 10.000 

Length Weight: 3 Average Mismatch: 0.000 

Quality: 14 359 Length: 3934 

Ratio: 5.112 Gaps: 15 

Percent Similarity: 55.316 Percent Identity: 55.316 

Match display thresholds for the alignment (s ) : 
I = IDENTITY 
: = 5 
. = 1 

humajivr2 . seq x humanvrl . seq 



1 GGCTAGCCTGTCCTGACAGGGGAGAG 26 

I I I II I I I I I 
801 TGTCCACAGTAGTCCCCCCTTATCCACGGGTGTCACTTTCCATGGGTTCA 850 
• . * • • 

27 TTAAGCTCCCGTTCTCCACCGTGCCGGCTGGCCAGGTGGGCTGAGGGTGA 76 
I I I I I I I I I I I I I I I I II 

851 GTTATTTGCGGTCAACCACGGTCTGCCAATATTAAATGGAAJ\ATTCTTCA 900 

77 CCGAGAGACCAGAACCTGCTTGCTGGAGCTTAGTGCTCAGAGCTGGGGAG 126 
II Ml III III I II I I I I III 

901 AACAGTTCCCAAGTTTTCCCTTGTGCATTGTTCTGAGCAGTGTGATGAAG 950 



127 GGAGGTTCCGCCGCTCCTCTGCTGTCAGCGCCGGCAGCCCCTCCCGGCTT 176 

I II II I II I I I I I I I 

951 AGTCTCTGCCGTGCCATCTGGGATGCAAACCGTCCCTGTGTCCCCCACGT 1000 

• * • • ■ 

177 CACTTCCTCCCGCAGCCCCTGCTACTGAGAAGCTCCGGGATCCCAGCAGC 226 
I II I I I I I III III I I I I I 

1001 CCAGGCCGTAGATGCTCCCCGCCGGTCAGTCACTTAGTCGTCAGATCGCC 1050 

• • • • • 
227 CGCCACGCCCTGGC CTCAGCCTGCGG6 253 

"•III I I J II III I I 

1051 CGTCCTGGTATCACAGTGCTTCTGTTCAGGTTGCACACTGGGCCACAGAG 1100 



FIGURE 6 



254 GCTCCAGTCAGGCCAACflBCGACGCGCAGCTGGGAGGAAG ^2 9 3 

I Mill HI I I Mill II I 

1101 GATCCAGCAAGGATGAAGAAATGGAGCAGCACAGACTTGGGGACAGCTGC 1150 

• • • • 

294 . ACAGGACCCTTGACATCTCCATCTGCACAGAGGTCCTG 331 

I I I I I i I I I I II I I I I III Ml 

1151 GGACCCACTCCAAAAGGACACCTGCCCAGACCCCCTGGATGGAGACCCTA 1200 

• • • • , 

332 GCTGGACCGAGCAGCCTCCTCCTCCTAGGATGACCTCACCCTCCAGC . . T 37 9 
Ml I II I II I I II I II III 

1201 ACTCCAGGCCACCTCCAGCCAAGCCCCAGCTCCCCACGGCCAAGAGCCGC 1250 

• . . • • • . . 
380 CTCCAGTTTTCAGGTTGGAGACATTAGATGGAGGCCAAGAAGATGGCTCT 429 

II I I I II II III I M II I I I ! I I 

1251 ACCCGGCTCTTTGGGAAGGGTGACTCGGAGGAGGCTTTCCCGGTGGATTG 1300 

4 30 GAGGCGGACAGAGGAAAGCTGGATTTTGGGAGCGGGCTGCCTCCCATGGA 4 79 
I III I I I I I I I I I 

1301 CCCCCACGAGGAAGGTGAGTTGGACTCCTGCCCGACCATCACAGTCAGCC 1350 

4 80 GTCACAGTTCCAGGGCGAGGACCGGAAATTCGCCCCTCAGATAAGAGTCA 52 9 
I I III II I I I I II I 'III II I 

1351 CTGTTATCACCATCCAGAGGCCAGGAGACGGCCCCACCGGTGCCAGG. 1398 

530 ACCTCAACTACCGAAAGGGAACAGGTGCCAGTCAGCCGGATCCAAACCGA 579 
II II I II II II I I I I 

1399 TGCTGTCCCAGGACTCTGTCGCCGCCAGCACCGAGAAGACCCTCAGGCTC 14 4 8 

580 TTTGACCGAGATCGGCTCTTCAATGCGGTCTCCCGGGGTGTCCCCGAGGA 629 

I II I I I I II I I I II II III I I I I II I 

144 9 TATGATCGCAGGAGTATCTTTGAAGCCGTTGCTCAGAATAACTGCCAGGA 14 98 

630 TCTGGCTGGACTTGCAGAGTACCTGAGCAAGACCAGCAAGTACCTCACCG 67 9 

II II I till 1 I I II I II I I I II I I I II II I I 

14 99 TCTGGAGAGCCTGCTGCTCTTCCTGCAGAAGAGCAAGAAGCACCTCACAG 1548 



680 ACTCGGAATACACAGAGGGCTCCACAGGTAAGACGTGCCTGATGAAGGCT 729 
II I I I II II I I I II I II II I II II I I II I I I 

1549 ACAACGAGTTCAAAGACCCTGAGACAGGGAAGACCTGTCTGCTGAAAGCC 1598 

. 730 GTGCTGAACCTTAAGGACGGAGTCAATGCCTGCATTCTGCCACTGCTGCA 779 

I M I M I II I I M II I I I II II I I I I I I I I I 
1599 ATGCTCAACCTGCACGACGGACAGAACACCACCATCCCCCTGCTCCTGGA 1648 

• • • • m 

780 GATCGACAGGGACTCTGGCAATCCTCAGCCCCTGGTAAATGCCCAGTGCA 829 
Mill III I I I I I II II I I II I I I III 
1649 GATCGCGCGGCAAACGGACAGCCTGAAGGAGCTTGTCAACGCCAGCTACA 1698 

830 CAGATGACTATTACCGAGGCCACAGCGCTCTGCACATCGCCATTGAGAAG 879 
I M I I I I I I I II I I I I I I I I I II II I I I I I I I I I I 
1699 OGGACAGCTACTACiUlGGGCCAGACAGCACTGCACATCGCCATCGAGAGA .1748 



880 AGGAGTCTGCAGTGTGTGAAGCTCCTGGTGGAGAATGGGGCCAATGTGCA 929 
II II I I I I I I I I I I II II I I II II II I 1 I II 

1749 CGCAACATGGCCCTGGTGACCCTCCTGGTGGAGAACGGAGCAGACGTCCA 1798 

• • • a • 

930 TGCCCGGGCCTGCGGCCGCTTCTTCCAGAAGGGCCAAG. . . GGACTTGCT 976 
II I I I I II I I I I I I I I I I I III I I I I I I 
1799 GGCTGOGGCCCATGGGGACTTCTTTAAGAAAACCAAAGGGCGGCCTGGAT 1848 



FIGURE 6 (cont'd) 



977 TTTATTTCGGTGAGCTACCCCTCTCTTTGGCCX5CTTGCACCAAGCAGTGG 1026 
I II I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I III I 
184 9 TCTACTTCGGf GAACTGCCCCTGTCCCTGGCCGCGTGCACCAACCAGCTG 1898 

1027 GATGTGGTAAGCTACCTCCTGGAGAACCCACACCAGCCCGCCAGCCTGCA 1076 

I I I I I I I I I I I I I I I I I I I I I I I II II 

1899 GGCATCGTGAAGTTCCTGCTGCAGAACTCCTGGCAGACGGCCGACATCAG 194 8 

1077 GGCCACTGACTCCCAGGGCAACACAGTCCTGCATGCCCTAGTGATGATCT 1126 

I I I I I I I I I . I I I I I I I I I I I I I I M I I I I I II I II 
194 9 CGCCAGGGACTCGGTGGGCAACACGGTGCTGCACGCCCTGGTGGAGGTGG 1998 

1127 CGG ACAAC T C AGCTG AG AAC AT T GC ACTGG TG ACC AGC ATGTATG ATGGG 117 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1999 CCGACAACACGGCCGACAACACGAAGTTTGTGACGAGCATGTACAATGAG 204 8 

117 7 CTCCTCCAAGCTGGGGCCCGCCTCTGCCCTACCGTGCAGCTTGAGGACAT 122 6 

I I I I I I I I I II I I I II I I I I I I I I I II I 

204 9 ATTCTGATGCTGGGGGCCAAACTGCACCCGACGCTGAAGCTGGAGGAGCT 2098 

1227 CCGCAACCTGCAGGATCTCACGCCTCTGAAGCTGGCCGCCAAGGAGGGCA- 1276 

I I I I I I I I I I I I I I I I I I I I I I I I I I III 
2099 CACCAACAAGAAGGGAATGACGCCGCTGGCTCTGGCAGCTGGGACCGGGA 214 8 

1277 AGATCGAGAT T T TCAGGCACATCCTGCAGCGGG AGT T TTCAGGA 1320 

I I I II f I I I I I I I I I I I I I I I I I I I II 

214 9 AGATCGGGGTCTTGGCCTATATTCTCCAGCGGGAGATCCAGGAGCCCGAG 2198 

1321 CTGAGCCACCTTTCCCGAAAGTTCACCGAGTGGTGCTATGGGCCTGTCCG 1370 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
2199 TGCAGGCACCTGTCCAGGAAGTTCACCGAGTGGGCCTACGGGCCCGTGCA 2248 

1371 GGTGTCGCTGTATGACCTGGCTTCTGTGGACAGCTGTGAGGAGAACTCAG 1420 

I I I I I I I I I I I I I I I I I I I I I III III I I I I I I I I 
2249 CTCCTCGCTGTACGACCTGTCCTGCATCGACACCTGCGAGAAG7UVCTCGG 2298 

1421 TGCTGGAGATCATTGCCTTTCATTGCA. . . AGAGCCCGCACCGACACCGA 14 67 

I I I I I I I I I I I I I I I III I I I I I I I I I I I I 

2299 TGCTGGAGGTGATCGCCTACAGCAGGAGCGAGACCCCTAATCGCCACGAC 2348 

1468 ATGGTCGTTTTGGAGCCCCTGAACAAACTGCTGCAGGCGAAATGGGA. • . 1514 

III II I I I I I I I I I I I I I I III I I I I I I I II I I I I I 
2349 ATGCTCTTGGTGGAGCCGCTGAACCGACTCCTGCAGGACAAGTGGGACAG 2398 

1515 TCTGCTCATCCCCAAGTTCTTCTTAAACTTCCTGTGTAATCTGATCTACA 1564 

I III III I I I I I I I I I I I I I I I I I I I I I I 

2399 ATTCGTCAAGCGCATCTTCTACTTCAACTTCCTGGTCTACTGCCTGTACA 24 48 

1565- TGTTCATCTTCACCGCTGTTGCCTACCATCAGCCTACCCTGAAGAAGCAG 1614 

II I I I I I I I I I I I I I I I I I I I I I III II 

2449 TGATCATCTTCACCATGGCTGCCTACTA CAGGCCCGTGGATGGCTT 2494 

• • • • • 

1615 GCCGCCCCTCACCTGAAAGCGGAGGTTGGAAACTCCATGCTGCTGACGGG 1664 

I I I I I I I I II I I I I I I I I I I II I I I I I 
2495 GCCTCCCTTTA. . AGATGGAAAAAATTGGAGACTATTTCCGAGTTACTGG 2542 



FIGURE 6 (cont'd) 



1665 CCACATCCTTATCCTGCT AGGGGGG ATCTACCTCCTCGTGGGCCAGCTGT 1714 

I I M I I I I I I I I I I I I I I I I I I I I If 

254 3 AGAGATCCTGTCTGTGTTAGGAGGAGTCTACTTCTTTTTCCGAGGGATTC 2592 

• • > ■ • 
1715 GGTACTTCTGGCGGCGCCACGTGTTCATCTGGATCTCGTTCATAGACAGC 17 64 

III III I I I I I || II III III I I I I I I I 

2593 AGTATTTCCTGCAGAGGCGGCCGTCGATGAAGACCCTGTTTGTGGACAGC 264 2 

• • * • • 

17 65 TACTTTGAAATCCTCTTCCTGTTCCAGGCCCTGCTCACAGTGGTGTCCCA 1814 

III III II II. Ml I I I I I I I I I I I I I I f II 
264 3 TACAGTGAGATGCTTTTCTTTCTGCAGTCACTGTTCATGCTGGCCACCGT 2692 

1815 GGTGCTGTGTTTCCTGGCCATCGAGTGGTACCTGCCCCTGCTTGTGTCTG 18 64 
MINIM Ml I I I I I III III I I I I 

2 693 GGTGCTGTACTTCAGCCACCTCAAGGAGTATGTGGCTTCCATGGTATTCT 27 4 2 

18 65 CGCTGGTGCTGGGCTGGCTGAACCTGCTTTACTATACACGTGGCTTCCAG 1914 

I I I I I I II I I I I I III I If I I I I I I II II II I II I I I 
2743 CCCTGGCCTTGGGCTGG ACC AAC ATGC T C T ACT ACACCCGCGG T T TCC AG 2792 

1915 CACACAGGCATCTACAGTGTCATGATCCAGAAGGTCATCCTGCGGGACCT" 1964 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

27 93 CAGATGGGCATCTATGCCGTCATGATAGAGAAGATGATCCTGAGAGACCT 2842 

1965 GCTGCGCTTCCTTCTGATCTACTTAGTCTTCCTTTTCGGCTTCGCTGTAG 2014 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
2843 GTGCCGTTTCATGTTTGTCTACATCGTCTTCTTGTTCGGGTTTTCCACAG 28 92 

2015 CCCTGGTGAGCCTGAGCCAGGAGGCTTGGCGCCCCGAAGCTCCTACAGGC 2064 

I I I I I I I I I I I I I I I I I I I I 1 

2893 CGGTGGTGACGCTGATTGAAGACGGGAAGAATGACTCCCTGCCGTCTGAG 2942 

2065 CCCAATGCCACAGAGTCAGTGCAGCCCATGGAGGGACAGGAGGACGAGGG 2114 

III I III 11(111 II 

2943 TCCA CGTCGCACAGGTGGCGGGGGCCTGCCTGCAGGCC 2980 

2115 CAACGGGGCCCAGTACAGGGGTATCCTGGAAGCCTCCTTGGAGCTCTTCA 2164 

I II I I I I I I I I I I I I I I I I I I I I I I 

2981 CCCCGATAGCTCCTACAACAGCCTGTACTCCACCTGCCTGGAGCTGTTCA 3030 

2165 AATTCACCATCGGCATGGGCGAGCTGGCCTTCCAGGAGCAGCTGCACTTC 2214 

I I I I I I II I I I I II I I I I I I I I I I I III I I I I I I I I I 

3031 AGTTCACCATCGGCATGGGCGACCTGGAGTTCACTGAGAACTATGACTTC 3080 

2215 CGCGGCATGGTGCTGCTGCTGCTGCTGGCCTACGTGCTGCTCACCTACAT 2264 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I II II 
3081 AAGGCTGTCTTCATCATCCTGCTGCTGGCCTATGTAATTCTCACCTACAT 3130 

2265- CCTGCTGCTCAACATGCTCATCGCCCTCATGAGCGAGACCGTCAACAGTG 2314 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3131 CCTCCTGCTCAACATGCTCATCGCCCTCATGGGTGAGACTGTCAACAAGA 3180 

2315 TCGCCACTGACAGCTGGAGCATCTGGAAGCTGCAGAAAGCCATCTCTGTC 2364 

I I I I II III II I I I I I I I I I I I I I I I I I I I I I I I I I II 

3181 TCGCACAGGAGAGCAAGAACATCTGGAAGCTGCAGAGAGCCATCACCATC 3230 



FIGURE 6 (cont'd) 



• • • . . 

2365 CTGGAGATGGAGAATGGCTATTGGTGGTGCAGGAAGAAG. . .CAGCGGGC 2411 

M I I I I I II I I I IN I I I II I I I I I I II I 

3231 CTGGACACGGAGAAGAGCTTCCTTAAGTGCATGAGGAAGGCCTTCCGCTC 3280 

• * • • . 
2412 AGGTGTGATGCTGACCGTTGGCACTAAGCCAGATGGCAGCCCGGATGAGC 24 61 

Ml I I I I I I II II I I I I I I I I I I II II 

3281 AGGCAAGCTGCTGCAGGTGGGGTACACACCTGATGGCAAGGACGACTACC 3330 

• * • • • 

24 62 GCTGGTGCTTCAGGGTGGAGGAGGTGAACTGGGCTTCATGGGAGCAGACG 2511 

I I I I I I I I I I I I N I I I I I I I I I I I I I II I I I III I I 
3331 GGTGGTGCTTCAGGGtGGACGAGGTGAACTGGACCACCTGGAACACCAAC 3380 

2512 CTGCCTACGCTGTGTGAGGACCCG . . . TCAGGGGCAGGTGTCCCTCGAAC 2558 
II I I I I I I I I I I II I I I I I I I I I 

3381 GTGGGCATCATCAACGAAGACCCGGGCAACTGTGAGGGCGTCAAGCGCAC 34 30 

2559 TCTCGAGAACCCTGTCCTG .... GCTTCCCCTCCCAAGGAGGATGAGGAT 2604 

II II III II II II II 

34 31 CCTGAGCTTCTCCCTGCGGTCAAGCAGAGTTTCAGGCAGACACTGGAAGA 34 80 

2605 GGTGCCTCTGAGGAAAACTATGTGCCCGTCCAGCTCCTCCAGTCCAACTG^ 2654 

i l I Ml M I ill ll II 

34 81 ACTTTGCCCTGGTCCCCCTTTTAAGAGAGGCAAGTGCTCGAGATAGGCAG 3530 

2655 ATGGCCC AG ATGC AGCAGG AGGCCAGAGGACAGAGCAGAGGATC T TTCCA 2704 

I I I I I I I I I I I I I I I I I I I I I I 

3531 TCTGCTCAGCCCGAGGAAGTTTATCTGCGACAGTTTTCAGGGTCTCTGAA 3580 

2705 ACCACATCTGCTGGCTCTGGGGTCCCAGTGAATTCTGGTGGCAAATATAT 2754 

III I I I I I I I II I II I III I 

3581 GCCA GAGGACGCTGAGGTCTTCAAGAGTCCTGCCGCTTCCGGGGA 3625 

2755 ATTTTCACTAACTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 2804 

I I I I I I I I I I III III 

3626 GAAGT GAGGACGT CACGCAGACAGCACTGT CAACAC TGGGCC T TAG GAGA 3675 

2805 AAAAA „ 2809 

3676 CCCCGTTGCCACGGGGGGCTGCTGAGGGAAGACCAGTGCTCTGTCAGCAG 3725 



FIGURE 6 (cont'd) 



CLUSTAL If (1.74) multiple fi^^ice alignment 

humanVR2 MTS PSSS PVFfU&TLDGGQEDQS EADRGKLDFGSGLP PMESQFQOEDRKFAPQ I RVNLNY 
ratVR2 



humanVR2 
ratVR2 



RKGTQASQPDPNRFDRDRIiFHAVSRGVPEDIAGIiPEYI^KTSKYIiTDSEYTEGSTO 



human VR2 
ratVR2 



MKAVLNLKDGVNACI LPLI^IDRDSGNPQPLVNAQCTOD^ 



human VR2 
ratVR2 



I^VENGANVHARAOGRFFQKGQGTCFYFGEIiPI>SLAACTKQWDW 

- STHASAIjSIAACTKQWDVVTYLLENPHQPASLEA 



humanVR2 
ratVR2 



TDSQGNTVLHALVMISDNSAENIALVTSMY^ 
TDSI/3NTVLHAIjVMIADNSPENSALVIHMYIX5LiIjQMG 

*** ***********.***** *** ******* ***********.* * * ****** 



human VR2 
ratVR2 



humanVR2 
ratVR2 



AAKEGKIE I FRHI LQREFSG - I^HLSRKFTEWCYGPVRVSLYDLAS VDSCEENS VLE I IA 
AAKEGKIEIFRHILQREFSGPYQPLSRKFTEWCYGPVRVSL^ 

******************** # A************************ ********** 

raCKSPHRHRMVVIiEPLNKIJXJAKWDIiLIPKFFL^^ 
FHCKSPNRHRMVVLEPLNKliliQERWDRLVSRFFFNFA 

********************** *** *.,.**.** * A**************** * 



humanVR2 
ratVR2 



APHI*KAEVGNSMI*I,TGHILIIiI/^ 
IPSSKATFGBShlIiI*IiGHILILIjGGI 
* ** ****** A******************************************** 



humanVR2 
ratVR2 



LTWSQVIX^TJUEWYIiPIJ^VSAIiV.^^ 
LTVI*SQVLRFMETEWYI^LLVI^LVI/^^ 

******** *. ******** ************************************** 



humanVR2 
ratVR2 



LIYLVFLFGFAVALVSIjSQBAWRPEAPTGPI^ 
I*VYI*VFIjFX3FAVAXjVSLSREA^ 

********************* *.** . *.* • ** ***** * ********* 



humanVR2 
ratVR2 



bumanVR2 
ratVR2 



huaanVR2 
ratVR2 



ZiFKFTIGhfGEIAFQEQI£FRCa4^^ 
LFKFTIGNaEIAFOEQUtfTtt^^ 

A********************.*************}*************** ** ***** 

K10^KAISVI*I5MENaYWWCR 
KK^KATSVIjBHENGYHKCRRKICHRE^ 

******************** **j* * t*.***» **t***************j**:** 

PTXXXDPfXZIUGFVPRXXiENi^^ 

PTL6BDPSOPO XTONKKNPTSK- POK HSASEEDHLPIjQVLQSP 

***.**♦**♦*.. * I**. * * I. ****!! l*t*.*** 



fTGOTE 7 



GAP of: ratvr2.pep check: 0 from: 1 to: S54 

ratVR2 Flrxbl47gli 

to: human vr 2. pep check: 5746 from: 1 to: 764 
humanVR2 Flh21ell 

Symbol comparison table: /us r/ local /gcg_9 • l/gcgcore/data/rundata/blosum62 
CompCheck: 6430 



Gap Height: 12 

Length Weight: 4 

Quality: 2162 

Ratio: 3.939 

Percent Similarity: 81.703 



Average Match: 2.912 
Average Mismatch: -2.003 

Length: 766 
Gaps : 4 
Percent Identity: 79.167 



Match display thresholds for the alignment (s) 
| = IDENTITY 

: = 2 
. = 1 

ratvr2 . pep x humanvr2 . pep 



201 



STHASALS IAACTKQWDVVTYIjIiENPHQPAS LEATDSI/3NTVLH 

I I I I I I I I I I I I I - I I II f I I I I I I I ^ I I I I I I I I I 

GQGTCCTFGELPI^LAACrrKQWDVVSYItLENP^ 



45 ALVMIADNS PENSALVTHMYDGLLQMGARIjCPTVQIiEE ISNHQGLTPLKL 

1 1 1 1 1 * 1 1 1 II III 1 1 1 1 1 1 1 llllllllllhl I I MINI 

251 ALVMISDNSAENIALVTSMYDGLLQAGARI^ 
95 AAKEGKIE IFRHILQREFSGPYQPLSRKFTEWCYGPVRVSLYDLSSVDSW 

1 1 1 1 ! f 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f ( 1 1 f 1 1 1 

301 AAKEGKIEIFRHILQREFSG . LSHLSRKFTEWCYGPVRVSLYDIASVDSC 
145 EKKSVLEI IAFHCKS PNRHRMVVlUEPI/nCIjIiQEKWDRLVSRFFFNFA 

i * 1 1 1 1 1 1 1 1 1 1 1 1 1 1 * 1 1 1 i 1 1 1 1 1 r 1 1 1 1 1 mi h hi ii i i 

350 EEHSVLEIIAFHCKSPHRHRMVVIjEPLNKL^ 

• • • • • 

X9S V KMFiyiVVAYHQPSI^PAIPSSKATFGBSMLLIiGm 

< 1 1 1 1 1 1 MIMM . II II I Mil 1 1 1 1 ■ 1 1 1 1 1 1 1 1 * I 

400 IYMFIFTAVAYHQPTIJqOQAAPHLKABVC^ 

245 QLKYFHRRRLFIHISFMDSYrailiFI^ 

llllllll -111111. Illllllll I I I I I I - I I I I l> lllllll 
450 QIAfOTHERKVFXWXSFXDSYTEXljFXiFQ^ 

• • . • • • 
295 VLSLVI/3WI2n^YYTRGFQOTG ITSVMIQKVIIj^LLRFLLVYLVFIjPOF 

I - 1 1 1 1 1 I 1 1 1 1 I I I I 1 1 I I 1 1 1 I I 1 1 II 1 1 1 1 1 I I I I 1 1 « I 1 1 1 1 1 I I 

500 VSM»VI«fIiRIJgYTM 

• • • • • 
345 AVALVSLSREARSPKAPEDKKSTVTTEQPTVGOHBE . . PAPYRS ILDASLE 

llllllll. II Ml 1*1 • II MM I Jl MtMII 
550 AVALVSLSQEAHRPEAPTOPNATESVQPHBGQH^ 



44 



250 



94 



300 



144 



349 



194 



399 



244 



449 



294 



499 



344 



549 



392 



599 



FIOTBE 8 



393 I*FKFTXGMaBIAFQBuI<R|^tenJUJj^ 44 
I I I III I I I III I I I I I Wl I I II I I I I I I II • I I I I I I I I I I I I I I 

600 LFKFTIGMaBIAFQEQUfFlR&V^^ €49 

• • • * . 

443 NHVADKS WS IWKLQKAISVZiQ^E^YKHCRRKKHREGRIiLKVGTRODOTP 492 

I II 11111111111111111111111 III I I :| llh II. I 
650 NS VATDS WS IWKIiQKAISVLEMENaYWWC . RKKQRAOVMLTVOTKPDaSP 698 

493 DERWCFRVEBVNHAAWEKTLPTLSEDPSGPOITGNKKNPT SKPGKN 536 

I I I I I II I I I I I I I * I I . I I | | | | | | | | |: .|| I 

699 DERHCFRVEEVNHASWEQTLPTLCEDPSGAGVPRTLENPVLASPPKEDED 748 

539 SASEBDHIiPLQVliQSP 554 

||||.>.|.|.||| 
749 GASEENYVPVQLLQSN . 764 



FIGURE 8 Continued) 



GAP of: humanvrl . seq checl^ 4554 from: 1 to: 3909 
humanVRl Fbhl8547pat - Import - complete 

to: ratvrl.seq check: 7921 from: 1 to: 2847 

ratVRl.seq AF029310 in GenBank 

Symbol comparison table: 
/ddm_local/gcg/gcg_9 . 1/gcgcore/data/rundata/nwsgapdna . cmp 
CompCheck: 8760 

Gap Weight: 50 Average Match: 10.000 

Length Weight: 3 Average Mismatch: 0.000 

Quality: 22717 Length: 3914 

Ratio: 7.979 Gaps: 10 

Percent Similarity: 82.125 Percent Identity: 82.125 

Match display thresholds for the alignment (s ) : 
i = IDENTITY 
: = 5 
. - 1 

humanvrl . seq x ratvrl . seq . 



1001 CCAGGCCGTAGATGCTCCCCGCCGGTCAGTCACTTAGTCGTCAGATCGCC 1050 

II I III 

1 CAGCTCCAAGGCACTTGCTCC 21 

1051 CGTCCTGGTATCACAGTGCTTCTGTTCAGGTTGCACACTGGGCCACAGAG 1100 
I I I I I II II I I I I I I I I I I I I I I I I I I I I I 

22 ATTTGGGGTGTGCCTGCACCT . . . AGCTGGTTGCAAATTGGGCCACAGAG 68 

1101 GATCCAGCAAGGATGAAGAAATGGAGCAGCACAGACTTGGGGACAGCTGC 1150 
MM I M I I I II I I I I I Ml Mill II III 
69 GATCTGGAAAGGATGGAACAACGGGCTAGCTTAGACTCAGAGGAGTCTGA 118 

1151 GGACCCACTCCAAAAGGACACCTGCCCAGACCCCCTGGATGGAGACCCTA 1200 
I I II II II I I II M M I M I II M I I II M II I II M 
119 GTCCCCACCCCAAGAGAACTCCTGCCTGGACCCTCCAGACAGAGACCCTA 168 

• • • • • 
1201 ACTCCAGGCCACCTCCAGCCAAGCCCCAGCTCCCX^CGGCC^AGAGCCGC 1250 

III II II II II M II I M I II II II II III III III II 
169 ACTGCAAGCCACCTCCAGTCAAGCCCCACATCTTCACTACCAGGAGTCGT 218 

• • • * • 
1251 ACCCGGCTCTTTGGGAAGGGTGACTCGGAGGAGGCTTTCCCGGTGGATTG 1300 

I I I I I I I I I I I I I II || It I I I I I I M I I I I I II I II I II I II 
219. ACCCX5GCTTTTTGGGAAGGGTGACTCGGAGGAGGCCTCTCCCCTGGACTG 268 

1301 CCCXXZACGAGGAAGGTGAGTTGGACTCCTGCCCGACCATCACAGTCAGCC 1350 

III I II I M I II I I III II II I I II I MINI I I I I I I 
269 CCCTTATGAGGAAGGCGGGCTGGCTTCCTGCCCTATCATCACTGTCAGCT 318 

• • • * • 
13S1 CTGTTATCACCATCCAGAGGCCAGGAGACGGCCCCACCGGTGCCAGGCTG 1400 

I I I I I I II I I I I I I I I I I I II II II II II III I I I I I I 
319 CTGTTCTAACTATCCAGAGGCCTGGGGATGGACCTGCCAGTGTCAGGCCG 368 



FIGURE 9 



• • • • ■ 

1401 CTGTCCCAGGACTCTGTCGCCGCCAGCACCGAGAAGACCCTCAGGCTCTA 14 50 
I I II I I I I I I I Ml I I I I I I I I I I I III I I I I I I I 1 

369 TCATCCCAGGACTCCGTCTCCGCTGG . . - TGAGAAGCCCCCGAGGCTCTA 415 

14 51 TGATCGCAGGAGTATCTTTGAAGCCGTTGCTCAGAATAACTGCCAGGATC 1500 
I I I I I I I I I I I I I I I I I II II II I I I I I I I I I I I I I I I I I I I I 
416 TGATCGCAGGAGCATCTTCGATGCTGTGGCTCAGAGTAACTGCCAGGAGC 4 65 

1501 TGGAGAGCCTGCTGCTCTTCCTGCAGAAGAGCAAGAAGCACCTCACAGAC 1550 
I I I II I I I I I II I I f I I I I I I I II I I I I I I I I I I I I I III II III 
4 66 TGGAGAGCCTGCTGCCCTTCCTGCAGAGGAGCAAGAAGCGCCTGACTGAC 515 

1551 AACGAGTTCAAAGACCCTGAGACAGGGAAGACCTGTCTGCTGAAAGCCAT 1600 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

516 AGCGAGTTCAAAGACCCAGAGACAGGAAAGACCTGTCTGCTAAAAGCCAT 565 

1601 GCTCAACCTGCACGACGGACAGAACACCACCATCCCCCTGCTCCTGGAGA 1650 
I I I I f I I I I I I I I II I I I I I I I I I I I I I I II I f I I I I I I 
566 GCTCAATCTGCACAATGGGCAGAATGACACCATCGCTCTGCTCCTGGACG 615 

1651 TCGCGCGGCAAACGGACAGCCTGAAGGAGCTTGTCAACGCGAGCTACACG- 170 0 

I II III I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I " 

616 TTGCCCGGAAGACAGACAGCCTGAAGCAGTTTGTCAATGCCAGCTACACA 665 

1701 GACAGCTACTACAAGGGCCAGACAGCACTGCACATCGCCATCGAGAGACG 1750 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II 

666 GACAGCTACTACAAGGGCCAGACAGCACTGCAGATTGCCATTGAACGGCG 715 

1751 CAACATGGCCCTGGTGACCCTCCTGGTGGAGAACGGAGCAGACGTCCAGG 1800 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I 1 I I I I I I 
716 GAACATGACGCTGGTGACCCTCTTGGTGGAGAATGGAGCAGATGTCCAGG 765 

1801 CTGCGGCCCATGGGGACTTCTTTAAGAAAACCAAAGGGCGGCCTGGATTC 1850 
I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I III 
766 CTGCGGCTAACGGGGACTTCTTCAAGAAAACCAAAGGGAGGCCTGGCTTC 815 

1851 TACTTCGGTGAACTGCCCCTGTCCCTGGCCGCGTGCACCAACCAGCTGGG 1900 
I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I 
816 TACTTTGGTGAGCTGCCCCTGTCCCTGGCTGCGTGCACCAACCAGCTGGC 865 

• • * • * 
1901 CATCGTGAAGTTCCTGCTGCAGAACTCCTGGCAGACGGCCGACATCAGCG 1950 

I I I I I I I I I I I I I I I I Mil I I I I I I I I I I I I I I II I I I I I I I I I I 
866 CATTGTGAAGTTCCTGCTGCAGAACTCCTGGCAGCCTGCAGACATCAGCG 915 

• • * • . • 
1951 CCAGGGACTCGGTGGGCAACACGGTGCTGCACGCCCTGGTGGAGGTGGCC 2000 

II I I I I I I I I I I I I I I I I I I I I I I I I II I I I 1 I I I I I I I I II I I I 

916 CCCGGGACTCAGTGGGCAACACGGTGCTTCATGCCCTGGTGGAGGTGGCA 965 

• * • * • 
20G1 GACAACACGGCCGACAACACGAAGTTTGTGACGAGCATGTACAATGAGAT 2050 

.' I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
966 GATAACACAGTTGACAACACCAAGTTCGTGACAAGCATGTACAACGAGAT 1015 

• • • ■ • • 
2051 TCTGATGCTGGGGGCCAAACTGCACCCGACGCTGAAGCTGGAGGAGCTCA 2100 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I III III 
1016 CTTGATCCTGGGGGCCAAACTCCACCCCACGCTGAAGCTGGAAGAGATCA 1065 



FIGURE 9 (cont'd) 



1 



2101 CCAACAAGAAGGGAATGACGCCGCTGGCTCTGGCAGCTGGGACCGGGAAG 2150 
I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | I | I I | | | | 



1066 CCAACAGGAAGGGGCTCACGCCACTGGCTCTGGCTGCTAGCAGTGGGAAG 1115 

2151 ATCGGGGTCTTGGCCTATATTCTCCAGCGGGAGATCCAGGAGCCCGAGTG 2200 

M 1 I I I I I I I M I I I I I I I I I I I I I I I I f I I I I I I I II I I I I I I I I 
1116 ATCGGGGTCTTGGCCT ACATTCTCCAGAGGGAGATCCATGAACCCGAGTG 1165 

2201 CAGGCACCTGTCCAGGAAGTTCACCGAGTGGGCCTACGGGCCCGTGCACT 2250 

I I I I I I I I I I I I i I I I I I I M I I I MINIM I II I I I I M 1 I I 
1166 CCGACACCTATCCAGGAAGTTCACCGAATGGGCCTATGGGCCAGTGCACT 1215 

2251 CCTCGCTGTACGACCTGTCCTGCATCGACACCTGCGAGAAGAACTCGGTG 2300 

MM II II I I I I I I I I I I I I I I II I I I I II II I M I I I I I I I I 
1216 CCTCCCTTTATGACCTGTCCTGCATTGACACCTGTGAAAAGAACTCGGTT 12 65 

2301 CTGGAGGTGATCGCCTACAGCAGCAGCGAGACCCCTAATCGCCACGACAT 2350 

M I I II I I I II II I I I I I I I I I III I I 1 I I I I I I I I II II I I I I I 
1266 CTGGAGGTGATCGCTTACAGCAGCAGTGAGACCCCTAACCGTCATGACAXi- 13 15 

2351 GCTCTTGGTGGAGCCGCTGAACCGACTCCTGCAGGACAAGTGGGACAGAT 24 00 

Ml I I I I I I II I I I II I I I I I I I I I I I I II I M I II I I I I || | 
1316 GCTTCTCGTGGAACCCTTGAACCGACTCCTACAGGACAAGTGGGACAGAT 1365 

• • • • • 
2401 TCGTCAAGCGCATCTTCTACTTCAACTTCCTGGTCTACTGCCTGTACATG 24 50 

I M II I II I II I I I I I I I I I I I M I I I I I I I II I I I I I I I I I III 
1366 TTGTCAAGCGCATCTTCTACTTCAACTTCTTCGTCTACTGCTTGTATATG 1415 

2451 ATCATCTTCACCATGGCTGCCTACTACAGGCCCGTGGATGGCTTGCCTCC 2500 

I I I I M I I I I I I I I II I I I I II || I I I I I I I I I I I I I I I I I II 
1416 ATCATCTTCACCGCGGCTGCCTACTATCGGCCTGTGGAAGGCTTGCCCCC 14 65 

2501 CTTTAAGATGGAAAAAA TTGGAGACTATTTCCGAGTTACTGGAGAGA 2547 

II I I I I II I I I I I MM I II I II I I I M II I r I II II M I 
1466 CTATAAGCTGAAAAACACCGTTGGGGACTATTTCCGAGTCACCGGAGAGA 1515 

2548 TCCTGTCTGTGTTAGGAGGAGTCTACTTCTTTTTCCGAGGGATTCAGTAT 2597 

M I I II I I II I I I M M ! M 1 I I II I I M M I II II I I I II I I III 
1516 TCTTGTCTGTGTCAGGAGGAGTCTACTTCTTCTTCCGAGGGATTCAATAT 1565 

• • • • * 
2598 TTCCTGCAGAGGCGGCCGTCGATGAAGACCCTGTTTGTGGACAGCTACAG 2647 

I I I I I I I II I I I I I I I II I MM M I II I II II I II M I M I 
1566 TTCCTGCAGAGGCGACCATCCCTCAAGAGTTTGTTTGTGGACAGCTACAG 1615 

• • • • • 
2648 TGAGATGCTTTTCTTTCTGCAGTCACTGTTCATGCTGGCCACCGTGGTGC 2697 

M II I I II II M I I I I I I I II I I II II II I II M I Mill I 
1616, TGAGATACTTTTCTTTGTACAGTCGCTGTTCATGCTGGTGTCTGTGGTAC 1665 

• • • • • 
2698 TGTACTTCAGCCACCTCAAGGAGTATGTGGCTTCCATGGTATTCTCCCTG 2747 

M II I I I I II II I I I I I I I II II I II I I I I I I I I I I II II I M II II 
1666 TGTACTTCAGCCAACGCAAGGAGTATGTGGCTTCCATGGTGTTCTCCCTG 1715 

2748 GCCTTGGGCTGGACCAACATGCTCTACTACACCCGCGGTTTCCAGCAGAT 2797 

III I I II II II I I I I I I I I I I II I I I II I I II I If II I M I I I II I 
1716 GCCATGGGCTGGACCAACATGCTCTACTATACCCGAGGATTCCAGCAGAT 1765 



FIGURE 9 (cont'd) 



2798 GGGCATCTATGCCGTCATGATAGAGAAGATGATCCTGAGAGACCTGTGCC 284 7 

MINIMUM I M I I I I I I | | | M I I I I I I I I I I I I I I I II I II I 
1766 GGGCATCTATGCTGTCATGATTGAGAAGATGATCCTCAGAGACCTGTGCC 1815 
* 

2848 GTTTCATGTTTGTCTACATCGTCTTCTTGTTCGGGTTTTCCACAGCGGTG 2897 

1 I I (UK I I I I I I I I I I MINIM II II II II II II I III 
1816 GGTTTATGTTCGTCTACCTCGTGTTCTTGTTTGGATTTTCCACAGCTGTG 1865 

2898 GTGACGCTGATTGAAGACGGGAAGAATGACTCCCTGCCGTCTGAGTCCAC 294 7 

Mill I II I I II I. If I M II II II II II II I I I M II M II 

1866 GTGACACTGATTGAGGATGGGAAGAATAACTCTCTGCCTATGGAGTCCAC 1915 
* 

294 8 GTCGCACAGGTGGCGGGGGCCTGCCTGCAGGCCCCCCGATAGCTCCTACA 2 997 

I I I I I I M II I I I I II I I I | | | | | M | || Ml II I I 
1916 ACCACACAAGTGCCGGGGGTCTGCCTGCAAG . . . CCAGGTAACTCTTACA 1962 

2 998 ACAGCCTGTACTCCACCTGCCTGGAGCTGTTCAAGTTCACCATCGGCATG 304 7 

MINIMI! II I II II II II I I M I I M II II M I I II M II M M 
1963 ACAGCCTGTATTCCACATGTCTGGAGCTGTTCAAGTTCACCATCGGCATG 2012 

3048 GGCGACCTGGAGTTCACTGAGAACTATGACTTCAAGGCTGTCTTCATCA9? 3097 

M II II I II II I II II I II I II I I II M II II II II If II II II I II II 
2013 GGCGACCTGGAGTTCACTGAGAACTACGACTTCAAGGCTGTCTTCATCAT 2062 

3098 CCTGCTGCTGGCCTATGTAATTCTCACCTACATCCTCCTGCTCAACATGC 314 7 

I'M I II II I II II I I I II II M II If II It M II I II I II I II I I 
2063 CCTGTTACTGGCCTATGTGATTCTCACCTACATCCTTCTGCTCAACATGC 2112 

3148 TCATCGCCCTCATGGGTGAGACTGTCAACAAGATCGCACAGGAGAGCAAG 3197 

"Ml II II I II II II I II II I M I I I II II I II II I II I II I | | | 
2113 TCATTGCTCTCATGGGTGAGACCGTCAACAAGATTGCACAAGAGAGCAAG 2162 
* 

3198 AACATCTGGAAGCTGCAGAGAGCCATCACCATCCTGGACACGGAGAAGAG 3247 

I I M II II II I II I II J I I M M I II I II If II II II I II II I I II I I 
2163 AACATCTGGAAGCTGCAGAGAGCCATCACCATCCTGGATACAGAGAAGAG 2212 

• * • • 

3248 CTTCCTTAAGTGCATGAGGAAGGCCTTCCGCTCAGGCAAGCTGCTGCAGG 3297 

MINI II I I I M M M I M M I I II II II I I I I II I I I I I I I I II II 
2213 CTTCCTGAAGTGCATGAGGAAGGCCTTCCGCTCTGGCAAGCTGCTGCAGG 2262 

* • • * • 
3298 TGGGGTACACACCTGATGGCAAGGACGACTACCGGTGGTGCTTCAGGGTG 3347 

mm m mil mmiiii i i i i i i i i i i i i i i imiiiiii 

2263 TGGGGTTCACTCCTGACGGCAAGGATGACTACCGGTGGTGTTTCAGGGTG 2312 

3348 GACGAGGTGAACTGGACCACCTGGAACACCAACGTGGGCATCATCAACGA 3397 

HI Ml I I II I I M M II t II II II II M I I II I I I I I M I I I II I 
2313 GACGAGGTAAACTGGACTACCTGGAACACCAATGTGGGTATCATCAACGA 2362 
• 

3399 AGACCCGGGCAACTGTGAGGGCGTCAAGCGCACCCTGAGCTTCTCCCTGC 3447 

Mill I III I II I I I I M I I I I I I M I II I I I I I I I I I I II II I I I I 
2363 GGACCCAGGCAACTGTGAGGGCGTCAAGCGCACCCTGAGCTTCTCCCTGA 2412 

3448 GGTCAAGCAGAGTTTCAGGCAGACACTGGAAGAACTTTGCCCTGGTCCCC 34 97 

MMI II M II I I I I I I III M II II II I II M I I I I I I I I I III 
2413 GGTCAGGCCGAGTTTCAGGGAGAAACTGGAAGAACTTTGCCCTGGTTCCC 24 62 
♦ 

34 98 CTTTTAAGAGAGGCAAGTGCTCGAGATAGGCAGTCTGCTCAGCCCGAGGA 3547 



FIGURE 9 (cont'd) 



1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 II 

24 63 CTTCTGAGGGATGCAAGCACTCGAGATAGACATGCCACCCAGCAGGAAGA 2512 

• • • * • 

354 8 AGTTTATCTGCGACAGTTTTCAGGGTCTCTGAAGCCAGAGGACGCTGAGG 3597 

I I I I I III II I I I II II II I I I I I I I I I I I I I I I I I I 
2513 AGTTCAACTGAAGCATTATACGGGATCCCTTAAGCCAGAGGATGCTGAGG 2562 

3598 TCTTCAAGAGTCCTGCCGCTTCCGGGGAGAAGTGA . GGACGTCACGCAGA 364 6 

I I I I I I I It I I I I I I I I I I I I I I I I I I I I I 

2563 TTTTCAAGGATTCCATGGTCCCAGGGGAGAAATAATGGACACTATGCAGG 2612 

364 7 CAGCACTGTCAACACTGGGCCTTAGGAGACCCCGTTGCCACGGGGGGCTG 3696 

I I I I I III I I I I I I I 

2 613 GATCAATG CGGGGTCTTTGGGTGGTCTG 264 0 

3697 CTGAGGGAACACCAGTGCTCTGTCAGCAGCCTGGCCTGGTCTGTGCCTGC 374 6 

f I I I I I I M I I lit II III I I I I II I I I 11 

2641 CTTAGGGAAC . CAGCAGGGTTGACGTTATCTGGGTCCACTCTGTGCCTGC 2689 

3747 CCA . GCATGTTCCCAAATCTGTGCTGGACAAGCTGTGGGAAGCGTTCTTG 3795 

I I I I I I I I I I II I III II I I II I I It I I 
2690 CT AGGC AC ATTCCT AGGACTTCGGCGGGCCTGCTGTGGGAA . CTGGGAGQ: 2738 

37 96 GAAGCATGGGGAGTGATGTACATCCAACCGTCACTGTCCCCAAGTGAATC 38 4 5 

I I I I I I I I I I I I I I I I I I 'I I I I I I I 

2739 TGTGTGGGAATTGAGATGTGTATCCAACCATGA. . . TCTCCAAACATTTG 278 5 

384 6 TCCTAACAGACTTTCAGGTTTTTACTCACTTTACTAAAAAAAAAAAAAAA 38 95 

II I I I I I II II I I I I I I I I III 

2786 GCTTTCAACTCTTTATGGACTTTATTAAACAGAGTGAATGGCAAATCTCT 2835 

3896 AGGGCGGCCGCTTA 3909 

I III 
2836 ACTTGGACACAT . . 2847 



FIGURE 9 (cont'd) 



GAP of: humanvrl.pep check: 6877 from: 1 to: 839 



humanVRl _Fbhl8547pat - fchrb87a6, 3909 bases, 4554 checksum, 
to: ratvrl.pep check: 5764 from: 1 to: 838 

ratVRl | AF029310 Rattus norvegicus vanilloid receptor subtype 1 mRNA, 
complete 
cds . 



Symbol comparison table: 
/ddm_ local /gcg/gcg_9. l/gcgcore/data/rundata/blosum62 . cmp 
CompCheck: 64 30 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 3734 Length: 840 

Ratio: 4.456 Gaps: 3 

Percent Similarity: 89.247 Percent Identity: 86.022 

Match display thresholds for the alignment (s) : 

I = IDENTITY - 
: = 2 
. = 1 

human vrl . pep x ratvrl . pep 

1 MKKWSSTDLGTAADPLQKDTCPDPLDGDPNSRPPPAKPQLPTAKSRTRLF 50 

I ... I I . I I ... I I I I I I I : I I I I I : I : I I I I I I 
1 MEQRASLDSEESESPPQENSCLDPPDRDPNCKPPPVKPHIFTTRSRTRLF 50 

51 GKGDSEEAFPVDCPHEEGELDSCPTITVSPVITIQRPGDGPTGARLLSQD 100 

I I I I I I I I I . I I I : I I I I III I I I I 1:111111111 I III 
51 GKGDSEEASPLDCPYEEGGLASCPIITVSSVLTIQRPGDGPASVRPSSQD 100 



101 SVAASTEKTLRLYDRRSIFEAVAQNNCQDLESLLLFI^KSKKHLTDNEFK 150 

I I. I M M I I II I I I : I I I I . I I I : I I I I I I I I : I I I I I I . I I I 
101 SVSAG . EKPPRliYDRRSIFDAVAQSNCQELESLLPFLQRSKKRLTDSEFK 14 9 

151 DPETGKTCLLKAMLNLHDGQNTTIPLLLEIARQTDSLKELVNASYTDSYY 200 

I I I I I I I I I I I I I I I I I . I I I II II I : : I I . I I I I I : I I I I I I I I I I 
150 DPETGKTCLLKAMLNLHNGQNDTIALLLDVARKTDSLKQFVHASYTDSYY 199 



201 KGQTALHIAIERRNMALVTLLVENGADVQAAAHGDFFKKTKGRPGFYFGE 250 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I . I I I I I I I I I I I I I I I I I 
200 KGQTALHIAIERRNMTLVTLLVENGADVQAAAKGDFFKKTKGRPGFYFGE 24 9 

251 LPLSLAACTNQLGIVKFLLQNSWQTADISARDSVGNTVLHALVEVADNTA 300 

I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
250 LPLSLAACTNQLAIVKFLLQNSWQPADISARDSVGNTVLHALVEVADNTV 299 

301 DNTKFVTSMYNEI LMLGAKLH PTLKLEELTNKKGMT PLALAAGTGKI GVL 350 

I I I I II I I I I I I I I . I I I I I I M I I I I I : I I : I I : I I I I I I I -111111 

300 DNTKFVTSMYNE I LI LGAKLH PTLKLEEI TNRKGLT PLALAASSGKIG VL 34 9 

351 AYILQREIQEPECRHLSRKFTEWAYGPVHSSLYDLSCIDTCEKNSVLEVI 400 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

350 AYILQREIHEPECRHLSRKFTEWAYGPVHSSLYDLSCIDTCEKNSVLEVI 399 



FIGURE 10 



401 AYSSSETPNRHDMLLVEPLNRLLQDKWDRFVKRI FYFNFLVYCLYMI I FT 450 

M I I I I I I I I I I I I I I I I I | | | | | | | | | || | | | | | | I | | I I I I I | | | | I 
400 AYSSSETPNRHDMLLVEPLNRLLQDKWDRFVKRI FYFNFFVYCLYMI I FT 44 9 

• • . . . 

451 MAAYYRPVDGLPPFKMEK. IGDYFRVTGEILSVLGGVYFFFRGIQYFLQR 499 

1111111:1111:1:. : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4 50 AAAYYRPVEGLPPYKLKNTVGDYFRVTGEILSVSGGVYFFFRGIQYFLQR 4 99 

• a > • . 

500 RPSMKTLFVDS YS EMLFFLQS LFMLAT WL YFS HLKE YVASMVFS LALGW 54 9 

I I I : I - I M II I I I . I II . I I I I I I .MINI I I I I I I M I f I i : I I 
500 RPSLKSLFVDSYSEILFFVQSLFMLVSWLYFSQRKEYVASMVFSLAMGW 54 9 

• • • • • 

550 TNMLYYTRGFQQMGIYAVMIEKMILRDLCRFMFVYIVFLFGFSTAWTLI 599 

I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I : I I II I I I I I I I II I 
550 TNMLYYTRG FQQMG I YAVMI EKMI LRDLCRFMFVYLVFLFGFSTAVVTLI 5 99 

600 EDGKNDSLPSESTSHRWRG PACRPPDSSYNSLYSTCLELFKFTIGMGDLE 64 9 

I I I I I . I I I III I : II II: I . I I I I I I I I I I I I I I I I I I M I I I 
600 EDGKNNSLPMESTPHKCRGSACK . PGNS YNSLYSTCLELFKFTIGMGDLE 648 

650 FTENYDFKAVFIILLLAYVILTYILLLNMLIALMGETVNKIAQESKNIWli- 699 

I I I I I I I I II I II I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
64 9 FTENYDFKAVFIILLLAYVILTYILLLNMLIALMGETVNKIAQESKNIWK 698 

700 LQRAITILDTEKSFLKCMRKAFRSGKLLQVGYTPDGKDDYRWCFRVDEVN 74 9 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I : I I I I I I II I I I I I I I I I I 
699 LQRAI T I LDTEKS FLKCMRKAFRSGKLLQVG FT PDGKDDYRWC FRVDE VN 74 8 

750 WTTWNTNVG 1 1 NEDPGNCEG VKRTLS FS LRS S RVS GRHWKN FALVPLLRE 7 99 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I . I I I I I I I I I I I : 

74 9 WTTWNTNVGI INEDPGNCEGVKRTLS FSLRSGRVSGRNWKNFALVPLLRD 7 98 

800 ASARDRQSAQPEEVYLRQFSGSLKPEDAEVFKSPAASGEK 839 

II III . I III I : : . I I II I I I I I I I I III 
799 ASTRDRHATQQEEVQLKHYTGSLKPEDAEVFKDSMVPGEK 838 



FIGURE 10 (cont'd) 



CLUSTAL W (1.74) multiple se^kce alignment 



humanVR2 .alt 
humanVR2 



MTSPSSSPVFRI*ETUXXK)EIX?SEADRGKU>FGSGLPPMBSQF\& 



humanVR2.alt 
humanVR2 



RKGTGASQPDPNRFDRDRLFNAVS RGVPEDLAGLPE YL»S KTS KYLTDS E YTEGSTGKTCL 



human VR2 .alt 
humanVR2 



MKAVLNLIOXWNACILPLLQIDRDSGNPQPIA^ 



human VR2 .alt 
human VR 2 



-- - - GRFFQKGQGTCFYFX3EI/PLSLAACTRQWDWS YLiLENPHQPAS LQA 

IO,VENGANVHARACGRFFX3KGQGTCFYFGELPI^IJVACTKQTO 



human VR2 .alt 
humanVR2 



TDSQGNTVLiHAIjVM ISDNS AEN IALiVTSMYDGLLQAGARtiCPTVQLED I RNLQDLTPLKL 
TDSQGNTVLHALVM I SDNS AEN I ALVTSMYDG LLQAGARLCPTVQLED I RNLQDLTPLKL 
************************************************************ 



human VR2 .alt 
human VR2 



AAKEGKIE I FIWI LQREFSGLSHLiSRKFTEWCYGPVRVSLYDIJVS VDSCEENSVLE I IAF 
AAKEGKIEI FRHILQREFSGLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEI IAF 



human VR2 .alt 
human VR2 



HCKSPHRHRMVVLEPI2JKLLQAKWDLLI PKFFDNFLCNLI YMFIFTAVAYHQPTLKKQAA 
HCKSPHRHRMVVLEPLNTCLIXiAKWDLLIPKF 



humanVR2 . alt 
humanVR2 



PHLKAEVGNSMLLTGHILILLGGIYLLVGQLWYF^ 
PHLKAEVGNSMLLTGHILILI/XSIYI^VGQI^^ 



humanVR2.alt 
humanVR2 



TWSQVLCFLAI EWYLP LLVSALVLG WLNLLYYTRGFQHTG I YS VMI Q 

TVVSQVLCFTAIEira«PIiVSALVLGWI2I^ 



humanVR2.alt 
humanVR2 



rYLVFLFGFAVALVSLSQEAWRPEAPTGPNATESVQPMEGQEDEGNGA 



humanVR2.alt 
htEnanVR2 



. FKFTIGMGKLAFQEQLHFRGMVLLLL^ 



huuanVR2.alt 
humanVR2 



— lOCAISVLEMENGTfWKCRKIOQRAGVMLTVGTKPDC^ 

ioaKzixsvi£MENGYfctrciuenQ^^ 



humanVR2.alt 
humanVR2 



IXEDPSGtoaVPRTLENPVLASPPKEDEDGASEENYVFVQLI^ 
LCEDPSGAGVPRTLENPVIjAS PPKEDEDGASEENYVPVQI^^ 
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n Alpha, Regions - Chou-Fasman 
a Beta, Regions - Gamier-Robson 
n Beta, Regions - Chou-Fasman 
□ Turn, Regions - Gamier-Robson 



UIHI I I I HIIIII III III l llll Oil I II II I HII I Ifl IK I I 
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D Hydrophilicity Plot - Kyte-Doolittle 



[ Hydrophobicity Plot - Hopp-Woods 



II I llllilDII I I IB-aHHIf^WfHDDHM 



n Beta, Amphipathic Regions - Eisenberg 
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D Antigenic Index - Jameson-Wolf 




D Surface Probability Plot - Emlni 
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Protein Family / Domain Matches, HMMer version 2 

Searching for complete domains 

hmmpfam - search a single seq against HMM database 
HMMER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine 

HMMER is freely distributed under the GNU General Public License (GPU) . 

HMli fil'e: /prod/ddm/seqanal/PFAM/pfani4.2/Pfam 

Sequence file: / usr /ns -home/ docs /seqanal /or f anal/oa-script . 18670. seq 

Query: hVR _, 



Scores for sequence family classification (score includes all domains) : 

E-value N 



Model Description 
ank Ank repeat 



Parsed for domains: 

Model Domain seq-f seq-t 



hmm-f hmm-t 



Score 
51.5 

score E-value 



1.9e-ll 



ank 
ank 
ank 



1/3 
2/3 
3/3 



201 
248 
333 



233 
283 
361 



1 
1 
1 



33 [] 
33 [J 
33 [] 



34.4 

.13- 2 
3.4 



2.6e-06 
2 
26 



Alignments of top-scoring domains: \ 
ank: domain 1 of 3, from 201 to 233: score 34. 4 , E = 2.6e-06 

*->nGnTPLHlAarygnvewklLLehGAdvnartk<-* 
+G+T+LH+A + n+ +v lL+e+GAdv a+ 



hVR-1 201 KGQTALHI AI ERRNMALVTLLVENGADVQAAAH 233 

ank: domain 2 of 3 , from 248 to 283: score 13.2, E = 2 

* ->nGnTPLHlAarygnvewklLDe . . ,hGAdvnartk<-* 
hVP 1 n.o G PL lAa ++++ +vk+LL+++ + Ad+ ar+ 

nviw 248 FGE^PLSIJUlCTNQl^IVKFIJX^swQTADISARDS 283 

ank: domain 3 of 3, from 333 to 361: score 3.4, E = 26 

* ~>nGnTPLHlAarygnvewklLLehGAdvnartk< - * 

+G TPL lAa +g++ v ++ L+ + + 

hVR- 1 KGMTPLALAAGTGKIGVLAYILQ REIQEP 361 



Pi 13 
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■Aptta, Ketfons - QftnweMlobson 
■ASpta, rtagtona * Ctou-Fvwnen 
BBeta, Regions - Oaittfef'Robson 
EBete. Regions • Ctiou-Fnsntsn 
OTum, Regions • Qarttiar-Robson 
eTu."\ Regions • Chou-Fssrnan 
HCoB, Regions - Gamler-Robson 

nHydrophllkdty Plot - Kyte-OooBttte 

■Alpha, AmphlpatWc Regions - Qsenbeng 
a Botsu Amphipalhk: Regions - Bsenberg 
d RexSWe Regions - Kwplus-Schulz 



a Andgejiio index - Jamoson-Wotf 



G Surface ProbaMtty Plot • EraW 
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Protein Family / Domain Matches, HMMer Version 2 



Starch in? for complete domains 

luonpfom - search a single coc; against HKK database 
HMKER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 W&ahinaton Univerclcy School at fcediclne 
KKKER is freely distributed ««y^?^r the CNti General Public License (GPL) . 

HKK £11* i /proc7ddm/Ge<ianai/PF^/pfttm3.3/prGia 
Sequence file: /taqp/orf anal . 579 -aa - 

Choree for sequence family classification (score includes all domains) : 
tfedal Description 6core ti-volue 

ydc FF00023 Ank repeat 53.7 4e-12 

fejtrsed for domains: 

Model Domain sea;-f ccq-c hnra-f hrara-t score E-volue 

1/3 162 194 1 33 « 38-3 1.7e-07 

2/3 208 243 1 33 CI 6-4 «-3 

3/3 293 328 . . 1 33 11 8.8 2.1 

. . . . ** - • *•* 

Aligrnasntc of top— c corxnsr domaina* - - • *- 

mtet domain 1 ot 3, from 162 to 194: score 38.3, £ = 1.7e-07 
] ~ ♦->«GcfrPIjHlJLarygnVevVfc^ * 

? 4G++4XS+* «M- 4+MVkUH<*^OT««r 

- Flh21ell 162 rVOTSMfiTJVTEKRJSlW 194 

domain 2 of 3, from 208 to 243 1 aooro 6.4, B <• 4.3 

♦->nGo3^rglAaTy guvuvv1 cllJie« . ihGAdvnartfcc-* 

G CTf lAa 4- «flli04-f «• A* 

Fih2iQii 208 pcmrgijgrjwa w f ^ : 243 

domain 3 of 3, frafe 293 to 328 1 score 8-8, fi «= 2.1 

»^>neaicralAary yutfiawiit gliliq> * «faQftdgnart«-« 

FUh21oll 293 QQaT TOIganKimtro 328 




»hVR2.altFL (f ull -Xengt.' predicted) 1 ^ 

^^f^ VFRLETLD ^ EDGSI ^ I ™l DFGSGL PPMESQFQGEDRKFAPQ«ILNY 
^KGTGASQPDPNRFDRDRLFNAVSRGvWBLAGLPEYLSKTSKYLTDSEYTEGgWkTCL 
4KAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHIAIEKRSLQCVfc'" 

JREFSGLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEIIAFHCKSPHRHRMVVLE 

?LNKLLQAKWDLLIPKFFLNFLCNLIYMFIFTAVAYHQPTLKKQAAPHLKAEVGNSMLLT 

»HI LI LLGGI YLLVGQLWYFWRRHVFI WI SFI DS YFE I LFLFQAIiLTWSQVLCFLAI EW 

fLPLLVSALVI^WL^LYYTRGFQHTGIYSVMIQKKAISVLEMENGYWWCRKKQRAGVML 

-VGTKPDGSPDERWCFRVEEVNWASWEQTLPTLCEDPSGAGVPRTLENPVLASPPKEDED 
'.ASEENYVPVQLLQSN 



74) multip* sequence alignment 



# # 

mtspssspvfreetldggqedgseadrgkldfgsglppmesqfOgedrkfapotp^t 



LNY 

LNY 
**** 

RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTGK-rnr 

RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTGIfTnT 
************************************ ******************* 

MKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHIAIEKRSLOCVK 
MKAVLNLKDGVNAC I LPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHI AI EKRSLOCVK 
****************************************** ***************^ 4t ^ 

LLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDWSYLLENPHQPASLOA 

LLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDWSYLLENPHOPASLOA 
************************************ * * * * * * * * * * * * * * ^ ^ # ^ ^ ^ ^ ^ « ^ 

TDSQGNTVLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKL 
TDS QGNT VLHAL VM I S DNS AENI AL VT SM YDGLLQAGARLC PTVQLED I RNLQDLTPLKL 
********************************************************* #ifr ^ 

AAKEGKIE IFRH I LQRE FSGLSHLS RKFTEWCYGP VRVS L YDLAS VDS CEENS VLE I I AF 

AAKEGKIEIFRHILQREFSGLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEIIAF 
************************************************************ 

HCKS PHRHRMWLE PLNKLLQAKWDLL I PKFFLNFLCNL I YMF I FTAVAYHQ PTLKKQAA 

HCKSPHRHRMWLEPLNKLLQAKWDLLIPKFFLNFLCNLIYMFIFTAVAYHQPTLKKQAA 
***********************************************************^ 

PHLKAEVGNSMLLTGHILILLGGIYLLVGQLWYFWRRHVFIWISFIDSYFEILFLFQALL 
PHLKAEVGNSMLLTGH I L I LLGG I YLLVGQLWYFWRRHVF IWISFIDSYFEI LFLFQALL 
****************************************************** 

TWSQVLCFIjAI E WYL PLLVSALVLGWLNLL YYTRGFQHTG I YS VM IQKV I LRDLLRFLL 

TWSQVLCFIiAIEWYLPLLVSALVLGWLNLLYYTRGFQHTGIYSVMIQK 

******************************* ****************** 

IYLVFLFGFAVALVSLSQEAWRPEAPTGPNATESVQPMEGQEDEGNGAQYRGILEASLEL 
FKFTI<^EIiAFQEQLHFRGMVLLLLIiAYVLLTYILLIiN^ 



KA'rSVLEMENGYWWCRlOKQRAGVMLTVGTKPDGS PDERWCFRVEEVNWASWEQTLPT 

* **4k*.* ***** ********************************************** 

LCEDP^GAGVPRTLENPViSSPPKEDEDGASEENYVPVQLIiQSN 

LCEDPSGAGVPRTLENPVLASPPKEDEDGASEENYVPVQLLQSN - 
****** *, *********** * ************************ * 



